ORF Name 



* 



NTID 



AAID 



— — Score Pr obab ility 
Length Length ' ; 



Protein name 



13793 



Locus Name 



3.2e-24 



Acc# 



Description 



Q48815 



ORF Name 



NTID 



AAID 



"NTT AA 

— .— Score Probability 
Length Length 



ci 408 



Protein name 



9016 



GUT 



72T 



Locus Name 



0.0040 



Acc# 



hypothetical protein apjuust/b 



Description 



E 



ir:£7269b 



B72695 



ORF Name 



Protein name 



NTID 



llllD±Ab..±lJllt). 



T7W 



5017 



NT 



AA 



AAID Length Length 



Score Probability 



TTT 



12340 



Locus Name 



7-.6e-i0fc 



Acc# 



Description 



lsp:HEXA_>Okai 



P49008 



(BETA-NAHA^li!) 



ORF Name 



Protein name 



NTID 



9018 



NT 



AA 



AAID Length Length 



Score Probability 



T7TT 



Locus Name 



1.4e-6i-. 



Acc# 



long-cnain-tatty-acia CoA ixgase 



Description 



jpir:D70386 



D70386 



1001 



# 



NT 



AA 



ORF Name 



NT ID 
TTW1 



AAID 



Length Length 

cms — 



Score Probability 



\b9 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



21&2S£±2..±1...2B£ I 



AAID Length Length 
VUTQ — 



Score Probability 

firr* 



l 2.&e-07 



Protein name 



Description 



Locus Name 



gp:YP102Ktf 



Acc# 
AL031866 



Yersinia pestis 102 Kbases unstable region: trom 1 to 119443. 



NT 



AA 



ORF Name 



2a£maa...Gi..42a i to 



NTID AAID Length Length 

[?02I — 



TFTT 



Score Probability 
T5$ 



1.0e-08 



Protein name 

Description 
FECR PROTEIN 



Locus Name 



Isp : PSCfeJScfiLI 



Acc# 



P23485 



NT 



AA 



ORF Name 



NTID 



21M5£2A...al..£0A I (T^Ju - 



AAID Length Length 
9022 



7TT 



Score Probability 
1225 



1.0e-27 



Protein name 



Locus Name 



conserved hypothetical protein ylJDK 



pir :H69874 



Acc# 



H69874 



Description 



1002 



# 



NT 



AA 



ORF Name 



NTID 



3801 



AAID Length Length 
TO 



"SWT 



Score Probability 




la .le-OS 



Protein name 



Locus Name 



hypothetical protein S110687 



pxr :S7441€> 



ACC# 



S74416 



Description 



NT 



AA 



ORF Name 



NTID AAID Length Length 



[2TT 



X3TT 



Score Probability 
3T7 ' 



2.0e-29 



Protein name 



Locus Name 



N-acetylmuramoyl-L- alanine amidase 



pir:G7044S 



Acc# 



G70445 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



2A±^A0±..al...5.6A I 



9025 



Length Length 



Score Probability 
£T5 



1.3e-l* 



Protein name 
Description 

30S RIB0S0MAL PkOTEIM SIS (B321) 



Locus Name 



sp:RSia BACST 



Acc# 
P10806 



NT 



AA 



ORF Name 



NTID 



M4D.:mo.2...a3...Ai7. i mm 



AAID Length Length 
9026 



Score Probability 




1.0e-08 



Protein name 
Description 

SIALIDASE HifiCttftSOft, (NfiTOAMtNIfcASE) 



Locus Name 



sp : NANH MICVI 



Acc# 



Q02834 



1003 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



ITT7T 



TuTST 



I . ue-103 



Protein name 



Locus Name 



ArgE/DapE/Acyl family protein 



lpir:E75324 



Acc# 



E75324 



Description 



ORF Name 



NTID 



NT AA 

— _ T — Score Probability 
AAID Length Length JL 



TOUT 



Protein name 



hypothetical protein aq__1533 



Description 



Locus Name 



pir :A70433 



6 . ye-08 



Acc# 



A70433 



S3 



ii U 



ORF Name 



NTID 



\14.6A8A11.±±...±0±. I 



Protein name 



acritlavm resistance protein AcrE 



Description 



NT 



AA 



AAID Length Length 
&UT9 — 



Score Probability 
ll.Oe-lS 



TFT 



Locus Name 



pir:A7036l 



Acc# 



A70361 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2.^6.5.aza.7....C3....5.3.4; { 13808 



Protein name 

Description 
KO-HiT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 




Score Probability 
|2.6e-B3 



Locus Name 



hypothetical protein Tki^e>y 



tpir:D72274 



Acc# 



D72274 



Description 



1004 



NT 



AA 



ORF Name 



NTID 



AAID 



'24706575 c2 496 



|y032 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



247A7.2S.7....cl...2.7.&. 



3811 



TOT 



T3W 



1 . 6e-89 



Protein name 



Description 



Locus Name 



gp:PGU60208 



Acc# 



U60208 



Porphyromonas gingival is ortl, ort2 and ort3 genes, complete ccts. 



NT 



AA 



ORF Name 



i±&.oa±&i.±2...±i$. i itsis 



NTID AAID Length Length 

mji — 



7T 



TIT 



Score Probability 
?FI 



0.0011 



Protein name 



Locus Name 



sodium channel protein 



gp :DVU26 718 



ACC# 



TJ26718 



Description 



DrosopJiila virilis sodium channel protein (para) gene, exonsl, 2 , 3 , 4 , and 
optional segment i, partial cds . 



NT 



AA 



ORF Name 



NTID 



2<U3.Ufc.b.ai...t2....15m., 



TS1T 



AAID Length Length 
T31 



Score Probability 
71 



Protein name 



Locus Name 



hypothetical protein BBA32 



pir:H70210 



Acc# 



H70210 



Description 



1005 



NT 



AA 



ORF Name 



Protein name 



NT ID 



r — ^ _ — _ Score Probabi lity 
AAID Length Length ~ x ~ 

W(JJZ — 



0.034 



Locus Name 



cellulose synthase 



Description 



prr: 133714 



Acc# 



139714 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


pib.&3.xs.£.2^cl^3.:/.:L 


.... 3515 | 4037 




714 


1212 


3.2e-123 



Protein name 



rprY protein 



Description 



Locus Name 



pir:S33662 



Acc# 



S33662 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 

^UTS — 



15*5" 



Score Probability 
776 



5.2e-77 



Locus Name 



sp:YOEVJBACSU 



Acc# 



P54462 



Description 

HYPOTHETICAL bl.7 KD PROTEIN IN DWAJ-ftPaU INTEREGENK REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



Zb.6..7A15..7....Cl...3.7.8... 



■my 



~5TFTT 



Length Length 
3T~ 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



2b^.7.b.Uli...al...3.3.3. I HSTS 



NTID AAID Length Length 

9040 



TIT" 



Score Probability 




4 . 8e-03 



Protein name 



Locus Name 



sodium- dependent transporter homolog yocs 



plr :E63902 



Acc# 
E69902 



Description 



1006 



NT 



AA 



ORF Name 



NTID 



AAID 



2626655 C2 462 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TWU2 — 



Score Probability 
— 



2 . 5e-187 



Protein name 



Description 



Locus Name 



sp:MUTB_PORGl 



Acc# 



Q59676 



METHYLMALONYL - COk MOT ASK ALt>HA-SBfitJMIT, (MCM-ALt>HA) 



NT 



AA 



ORF Name 



NTID 



3821 



AAID Length Length 

inn 



Score Probability 
ITT77 — 



1.7e-119 



Protein name 



Locus Name 



nypotnetical protein TM1267 



pir:B7^274 



Acc# 



B72274 



Description 



ORF Name 



NTID 



Protein name 



Description 



AAID 



NT AA 

— — , Score Probability 
Length Length ~ 



1232 



2.5e-125 



Locus Name 



sp:G6PA_BACST 



ACC# 



P13375 



ISOMERASE A) 



1007 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



c2 450 







9045 50 


183 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length Probability 



2&£A£.$.l..±lJ±t).L I 



TTZT 



Protein name 
Descr iption 

(QUEUOSINE BIOSYNTHESIS &R0TEIM OUEA} 



Locus Name 



|sp:QUEA_ECOLI 



Acc# 



P21516 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
ST - 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

* , ™ T — T — ^. Score Probability 
AAID Length Length — -L - 



'2 .le-i26 



Protein name 



Locus Name 



sp : a YK_BACSU 



Acc# 



P37477 



Description 

LYSYL-TRNA SYNTHETASE, {LVSINE--TE>NA LIGASE) (LYSRS) 



1008 



NT 



AA 



ORF Name 



NTID 



AAID 



3S27 



9049 



Length Length 
TUT 



Score Probability 
HI 



0.0014 



Protein name 



Locus Name 



cytocnrome oxidase I 



gp:AF072662 



Acc# 
AF072662 



Description 



Exoneurella eremophila cytochrome oxidase I gene, mitochondrialgene 
encoding mitochondrial protein, partial cds . 



NT 



AA 



ORF Name 



NTID 



31426541 r3 2S2 



AAID Length Length 
552 



3775U 



Score Probability 
TIE 



Protein name 
Description 

HYPO T HE T ICAL £6.3 KB PRO T E I N IN HA02 5 'REGION 



Locus Name 



sp:YHA2JSIKC0 



Acc# 



P35649 



NT 



AA 



ORF Name 



3.1.7.5. 5taaa...ti...2La.. 



NTID AAID Length Length 

msi — 



5F7~ 



T7TJT" 



Score Probability 
TUTS — 



|4.0e-10S 



Protein name 



Description 



Locus Name 



sp:YIDEJIAEIN 



Acc# 



P44472 



HYPOTHETICAL PROTEIN HI063S 



ORF Name 



NTID 



AAID 



NT AA 

— ' — ~ , Score Probability 
Length Length 



3.lS.0.S.5.g..7....c3....S.g.a.. 



3..1e-09 



Protein name 



Locus Name 



probable lipid A biosynthesis acyltransterase 



pxr:H7id54 



ACC# 



H71954 



Description 



1009 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 
— 



Score Probability 



633 



Protein name 



Description 



Locus Name 



Acc# 



(NO- HIT 



NT 



AA 



ORF Name 



NTID 



m&:m7.:/....ci...i£a. i mrz 



AAID Length Length 
— 



354 



Score Probability 
TT2 



o . 00032 



Protein name 



Locus Name 



MutS-liKe protein 



gp : SATKXA 



Acc# 



AJ223480 



Description 



Staphylococcus aureus trxA and uvrC genes and partxal muts and dhsCgenes . 



ORF Name 



NTID 



NT AA 

— , — ■ Score Probability 
AAID Length Length 



ITT" 



o.aois 



Protein name 



Locus Name 



hypothetical protein C56G2.15 



pir:T15873 



Acc# 



T15873 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3.il£5.£12..±3....i0.7. I [T^T 



Length Length 
TT7™ 



Score Probability 
!T75" — 



|2.5e-13 



Protein name 



Locus Name 



probable isomerase 



pir :B70986 



ACC# 



B70986 



Description 



ORF Name 



Protein name 

Description 
MO-HIT 



NTID 



AAID 



NT 



AA 



Length Length 
573 ™ 



Score Probability 



T5T 



Locus Name 



Acc# 



1010 



NT 



AA 



ORF Name 



NTID 



I344110B1 13 300 



13836 



AAID Length Length 



Score Probability 
lb . 3e-07 



Protein name 



Description 



Locus Name 



|gp:PGU60208 



Acc# 
U6 02 08 



Porphyromonas gingival is ortl, or£2 ana or£3 genes, complete cds . 



n 



NT 



AA 



ORF Name 



3441714:2 ci 414 



NTID AAID Length Length 

I50S5 



[7T7 ) 



Score Probability 

[oto 



Protein name 



Locus Name 



sp:M0TAJ?0ft<3I 



Acc# 



Q59677 



Description 

METHYLMALONYL - COA MUTASE BETA-SUBUNIT, (MCB-BETA) 



n 

"J! ESI' 



J!! WSJ, ' 

I; J 



NT 



AA 



ORF Name 



NTID 



13838 



AAID Length Length 




Score Probability 



7T" 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length — -L 



3.&1346.5..7....tl...M 



13839 



TUT 



Protein name 



Locus Name 



hypothetical protein PAB0910 



pir :B75048 



Acc# 



B75048 



Description 



1011 



NT 



AA 



ORF Name 



NTID 



^1^2 c2 530 



AAID Length Length 

— 



Score Probability 

pus 



5 .4e-56 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



|pir:JC&027 



Acc# 
JC6 02 7 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TO" 



795 



Score Probability 
TTT7 



0. 000.81 



Protein name 



Locus Name 



hypothetical protein SCE3 9.3 0 



pir :T36240 



ACC# 



T36240 



Description 



NT 



AA 



ORF Name 



NTID 



iMiO.B.7....c2...4El I \TMZ 



AAID Length Length 

susi — 



1179 



Score Probability 
153 



4.9e-10 



Protein name 



Locus Name 



hypothetical protein cons 



bir:£7405i 



Acc# 



S74051 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — x ~ 



ft0£8^7.7....a2....4£3. I 



1737 



TOT" 



6.8e-84 



Protein name 
Description 

ELONGATION FACTOR G (BP-fl) 



Locus Name 



sp:fiFSjTHffHI 



Acc# 



P13551 



1012 



NT 



AA 



ORF Name 



NTID 



1 407^10 cA 614 



13844 



AAIP Length Length 



11175 



Score Probability 

prs — 



Protein name 



Locus Name 



receptor antxgen ( RagA ) 



tap:Pail3087a 



Acc# 



AJ130872 



Description 



Porphyromonas gingivalis W50 receptor antigen (rag) locus encocLmga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



AAID 



NT AA 
, — : , — ^. Score Probability 
Length Length — — 



41562 c3 S71 



Protein name 

Description 
UTO^HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



±26£.&.l...al...6.1.1.. 



3846 



Length Length 



Score Probability 
&.6e-22 



Protein name 



Locus Name 



proline/pyrroline-5-carboxylate dehydrogenase 



pir :BV1980 



ACC# 



B71980 



Description 



NT 



AA 



ORF Name 



\&129A2&.±1J1L5. I 



NTID AAID Length Length 




7JT 



Score Probability 
— ~ 



Protein name 



Locus Name 



sp : y j?'H'i' ECOLI 



Description 

HYPOTHETICAL 23.7 KD PROTEIN IN LRHA-ACKA INTkkUHJsIl^ UEGION 



8.4e-22 



Acc# 



P77625 



1013 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


4378530_ti__21 


j 3848 


5070 


485 


1458 458 


2.2e-44 



Protein name 



Locus Name 



Acc# 



probable glycosyi Hydrolase 



pir :T36467 



T36467 



Description 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


iaaaaas^a^aaa 


3845 3071 


171 


516 


96 


0.0057 



Protein name 



Locus Name 



Acc# 



putative outer surface protein 



gp:BMJ80960 



Description 



Borrelia burgdorferi straxn CA12 putative outer membrane protein (ospE) 
gene, complete cds and putative outer surface protein (ospF)gene, partial 
cds . 



NT 



AA 



ORF Name 



NTID 



4£5.1&3.5...±1...10.2 1 fT83U 



AAID Length Length 



Score Probability 
T^5T5 



1.7e-192 



Protein name 



Locus Name 



czrA protein 



gp : PACZR 



ACC# 



Y14018 



Description 

Pseudomonas aeruginosa czrR, czrc, czrB, czrA genes, ORFb andpartiai okjj'6 . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



A&aaaia..±<L.2L7.& I 



3 .4e-78 



Protein name 



Locus Name 



ribonucleoside -diphosphate reductase , large 
chain nrd 



pir:tiby457 



Acc# 



G69457 



Description 



1014 



NT 



AA 



ORF Name 



NT ID 



4704675 ±2 169 



AAID Length Length 
— 



5IT5" 



2718 



Score Probability 
2 . 5e-145 



Protein name 



Locus Name 



4-alpha-glucanotrans±erase homolog T20B5.4 



pir:T00748 



Acc# 



T00748 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|^S.DA5L3..7....tl...6.5... 



Length Length 




Score Probability 
1.2e-15 



197 



Protein name 



Description 



Locus Name 



|sp:FOLB m BACSU 



Acc# 



P28823 



DIHYDRONEOPTERIN ALDOLASE, (DHNA) 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
— 



Score Probability 
331 



12 .4e-8£ 



Protein name 



Description 



Locus Name 



sp:T0P3_HAEIN 



Acc# 



P43704 



DNA TOPOISOMERASE III, 



NT 



AA 



ORF Name 



NTID 



AAID 



£&&22L&.7...±2L.iaa I P3ss 



Length Length 
TTS3 



T35~ 



Score Probability 
559 



|2.£e~58 



Protein name 



Locus Name 



coproporphyrinogen oxidase, III , 
oxygen- independent hemN 



pir :B69640 



Acc# 



B69640 



Description 



1015 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


48S2'261_c2_466 


(3856 


|9078 


155 | 468 


271 




1.7e-23 


Protein name 








Locus Name 




Acc# 


ribosomai protein L09 


pir:B70475 




B70475 


Description 
















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4.S.3.112L5...X3....Z6.Z 


3857 




334 1005 


403 




1.7e-37 






Protein name 








Locus Name 




Acc# 










sp:GPDA__BAC^U 




P46919 


Description 
















DEPENDENT DIHYDROXYACETONE - PHOS PHATE 


REDUCTASE) 


























ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±9.15.6.15.^1^1 


3858 


9080 


128 3 


87 








Protein name 








Locus Name 




Acc# 


Description 
















NO-HIT 




















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


simfii^ai^atta 


3554 


4081 


261 786 


257 




3.4e-25 


Protein name 








Locus Name 




Acc# 


proJaaJDie reductase 


APE1044 






pir:E72703 




E72703 



Description 



1016 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
522 



173 



Score Probability 
ITS 



LJ.4e-.09 



Protein name 



Locus Name 



unknown 



Acc# 



AF095748 



Description 



Burkhoiaeria cepacia principal sigma tactor (sigA) , ph thai at edi oxygenase 
reductase (ophAl) , putative phthalate permeaseN- terminal region, putative 
phathalate permease C-terminal region (ophD) , 4 , 5-dihydroxyphthalate 
decarboxylase (ophC) , phthalate- inducible quinolinate phosphoribosyl 
transferase (or>hE) , transposase (tnp) , phthalate dihydrodiol dehydrogenase 



Ifl 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



J 13861 



22F 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
TZ1 — ~ 



Protein name 



Locus Name 



translation elongation tactor G 



pir :H72227 



Acc# 



H72227 



Description 



ORF Name 



NTID 



Protein name 



RprX 



Description 



NT 



AA 



AAID Length Length 




5TT 



Score Probability 
2^22 — 



Locus Name 



gp:S59000 



ACC# 



S59000 



1017 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



raw 



Score Probability 
[TuTS 



s.ye-78 



Protein name 



Description 



Locus Name 



sp : DNAA_BAC& U 



Acc# 



P05648 



CHROMOSOMAL REPLICATION INITIATOR PROTEIN DMAA 



ORF Name 



NT ID 



NT AA ^ 
_ — ^. — Score Probability 
AAID Length Length JL 



6447131 ±1 44 



\TTT 



Protein name 



Description 



Locus Name 



sp:UMG_HUMAM 



Acc# 



P13051 



URACIL -1)NA QLYCOii^LAS B PR^CUR^OR, (UDti) 



ORF Name 



NT ID 



AAID 



NT AA 
T — , — _ Score Probability 
Length Length — — — ■ 



4.4e-32 



Protein name 



Locus Name 



methylglyoxal syntnase 



pir :G72284 



Acc# 



G72284 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



fiaiiiii^„.ti...i&a 1 m^r 



TFT" 



Protein name 



Locus Name 



probable RNA polymerase sigma-24 factor 
(rpoE) 



pir:E71368 



Description 



2 . Oe-13 



Acc# 



E71368 



1018 



NT 



AA 



ORF Name 



NTID 



AAID 



16837762 C2 502 



3B6B 



Length Length 
T7T" 



Score Probability 

n ii.ae-i2 — 



nnrr 



Protein name 
Description 

HYPOTHETICAL MOTE IN MJ077S 



Locus Name 



sp:Y77tt_METJA 



Acc# 
Q58188 



NT 



AA 



ORF Name 



NTID 



13869 



AAID Length Length 
BTJ5I — 



T7TT 



Score Probability 
|i.6e-SS 



Protein name 



Description 



Locus Name 



|sp:ASMA__HAEXN 



Acc# 



P44338 



ASPARTATE- -AMMONIA LIPASE, (ASPARAUINE ^YNTHETA^Ii! A) 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
10.047 



Protein name 

Description 
HYPOTHETICAL P&0T21N HI0S04 



Locus Name 



sp:Y804_HAEIN 



Acc# 



P44053 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



11.75LfiO.£3....c3....^6...„. I [3TF7T 



^5" 



2.0e-6a 



Protein name 



Locus Name 



alpha galactosidase precursor 



gp:AP06l33i 



Acc# 



AF061331 



Description 



Saccharopolyspora erythraea alpha galactosidase precursor (melAj gene, 
complete cds . 



1019 



NT 



AA 



ORF Name 



NTID 



AAID 



£3 BO 



JW7T 



Length Length 




TFT 



Score Probability 
£T5 



1.4e~17 



Protein name 



Locus Name 



cyticLine deaminase 



Acc# 
AJ237979 



Description 



Bacillus caidolyticus cdd gene tor cytidme deaminase. 



NT 



AA 



ORF Name 



12773337 cl 77 



NTID AAID Length Length 



Score Probability 
H32 



l.le-06 



Protein name 



Locus Name 



conserved hypothetical protein yknZ 



pir :E6y858 



Acc# 
E69858 



Description 



NT 



AA 



ORF Name 



NTID 



13.BA5.Z1.7....t3....^.7. | 13874 



AAID Length Length 

^rm — 



Score Probability 



FT" 



Protein name 

Description 
BTO-HIT ~~ 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

M57 



Score Probability 
|2.Se-37 



WU1 



Protein name 



Locus Name 



unknown 



gp:AF083252 



Acc# 



AF083252 



Description 



Pseudomonas aeruginosa enoyl-CoA hydratase gene, partial cds; 
pilinbiosynthetic protein (fimL) gene, complete ccis; and unknown gene. 



1020 



NT AA 



ORF Name NT ID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


21665650_t2_54 5876 


9098 213 642 


448 




Protein name 




Locus Name 


Acc# 






sp:YKGB_HAK!W 


P44577 


Description 










HYPOTHETICAL &R0TE1N HI 021 y 








1 


ORF Name NT ID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


2214404i_r2_55 5877 


9099 287 864 


255 


1.8e-19 


Protein name 




Locus Name 


Acc# 


PoJdR protein 


gp:PPU25l7y2 


AJ251792 


Description 




Pseudomonas putxda pot>R gene 
protein. 


tor pojdR protein 


and poJDA gene rorFOD^a 




ORF Name NT ID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


12105A±1...q1.^A 5878 


9100 478 1437 


125 


0.00026 


Protein name 




Locus Name 


Acc# 


unKnown 


gp:U96771 


U96771 


Description 




Prevotella Joryantu putative 
mannanase genes, complete cds 


polygalacturonase , hi- 1 , 4 - 
• and unknowngenes . 


-endogiucanase, ana 




ORF Name NT ID 


NT 

AAID Length 


AA 

— , Score 
Length 


Probability 


lll$All.±±..±l 3879 


9101 467 1404 


970 


1.4e-97 


Protein name 




Locus Name 


Acc# 






sp:¥K(itJJiOoLl- 


P77212 


Description 










INTERGENIC kUGloN 








i 


1021 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



24095327 c3 106 



'3880 



|3.2e-6I 



Protein name 



Locus Name 



hemaggiut inin 



gp:AF017417 



Acc# 



AF017417 



Description 



Prevoteiia intermedia hemagglutinin (phgj gene, complete cds , 



ORF Name 



NTID 



NT AA 

— , — , , Score Probability 
AAID Length Length JL 



24642687 cl 66 



fTuTT 



|9.5e-53 



Protein name 



Locus Name 



sp:DCHSjSLC*E 



Acc# 



P04194 



Description 

HISTIDINE DECARBOXYLASE PROENZYME PRECURSOR, (PI CHAIN) 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



W7T 



Protein name 
Description 

ro^rrrT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Z5333.Q.8....a3...XQ3. | 



yios 



Length Length 
W1W 



11293 



Score Probability 
|4.2e-ll 



132 



Protein name 



Locus Name 



YvrN protein 



gp:BS4^KBDNA 



Acc# 



AJ223978 



Description 

Bacillus subtilis 42.7kB DNA tragment from yvsA to yvqA. 



1022 



• 



ORF Name 



26593051 c3 108 



NTID 



NT AA n , , 
— , — , Score Probability 
AAID Length Length — JL 



1.7e-ll 



Protein name 



Locus Name 



Acc# 



Hypothetical protein aq_2 94 



pir:H7032S 



H70326 



Description 



ORF Name 



\10.5.DAt52..±2J±b. 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length J - 



2.5e-77 



Locus Name 



putative ABC transporter ATP- binding protein 



gp:SCP55 



Acc# 



AL133424 



Description 



Streptomyces coeiicoior cosmict F56, 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length J - 



16A6.2116..„qL„6A I 11^ 



19108' 



ITT 



ITT" 



Protein name 

Description 
[R^TTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



IA3.0A26.&...13....61 1 



NTID AAID Length Length 

^Iu"9 — 



TO87 



63 



Score Probability 
72 



0.020 



Protein name 



Locus Name 



ORF MSV147 nypotnetical protein 



gpTSFUSTSSr 



ACC# 



AF063866 



Description 

Melanoplus sanguinipes entomopoxvirus , complete genome . 



1023 



NT 



AA 



ORF Name 



NT ID 



616677 c3 92 



AAID Length Length 
STTD 



or 



Score Probability 
TF3 



3 . oe-10 



Protein name 



Locus Name 



receptor antigen (Rag A J 



|gp:PSIi3oa7^ 



Acc# 



AJ130872 



Description 



Porpnyromonas gingivaiis W50 receptor antigen (rag J locus encodinga major 
immunodominant 55kDa antigen. 



NT 



ORF Name 



NTID 



AAID 



$75050 cl 7S 



Length Length 



AA 

— , Score Probability 



Protein name 

Description 
[NO-HIT — : 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



Protein name 



NTID 



3890 



AAID Length Length 



Score Probability 




8 . Oe-23 



Locus Name 



endo-Joeta-galactosidase 



gp:AF083896 



Acc# 



AF083896 



Description 



Flavoioacterium keratolyticus endo-Joeta-galactosidase gene/ completecds . 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J - 



11$..7.5.3A7....g3,..1M I 



IT 



T71e=TT 



Protein name 



Locus Name 



ruJoredoxm 



Acc# 



H72348 



Description 



1024 



ORF Name 



14354208 c3 195 



Protein name 



NTID 



NT AA 
, — ^ T — ^ Score Probabilit y 
AAID Length Length z - 



TIT 



TUT 



Locus Name 



Acc# 



Description 
INFO-HIT 



ORF Name 



Protein name 



NTID 



'i5L.7.iz&a3L...ta...aa f 



NT 



AA 



T — T — _ Score Probability 
AAID Length Length JL 

3TTS — 



Locus Name 



Acc# 



Description 
NO-HIT " 



ORF Name 



Protein name 



NTID 



AAID 



Trnr 



NT AA 
„ — — ^ Score Probability 
Length Length — j£ - 



7TT 



2TX 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length — JL 



is.ai4D.77...±i,.ai I mws 



WTTT 



ITT 



I4.3e-10 



Protein name 



Locus Name 



ORFS 



gp:D78257 



Acc# 
D78257 



Description 



Enterococcus taecaiis piasmicl pYH7 genes tor BacA, BacB, ORi J, 3 / ORF4 / ORF5, 
ORF6, ORF7, ORF8 , ORF9, ORF10, ORF11 , partial cds , 



1025 



NT 



AA 



ORF Name 



NT ID 



c2 161 



AAID Length Length 

sirs — 



Score Probability 
TT2 



.2.4e-05 



Protein name 



Locus Name 



unknown 



gp:AF116463 



Acc# 



AF116463 



Description 



Streptomyces lincolnensis putative regulatory protein WdlA (wdlA)gene, 
complete cds; and unknown gene. 



NT 



AA 



ORF Name 



21756552 t2 67 



NT ID AAID Length Length 
Ml 1 \5TT9 1 |2T7 1 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



Iii22am...c2...nci.. I 



Length Length 
5TS 



Score Probability 




|5.3e-30 



Protein name 



Locus Name 



hypothetical protein slrl534 



pir:S75853 



Acc# 



S75855 



Description 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1116.0.9.D.Q....C1..124, I 



wrnr 



Protein name 



Locus Name 



Acc# 



Description 
WO-HIT 



1026 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length A - 



23446055 ci 116' 



Protein name 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length J - 



fcimm..±JL.42A I {TWT 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



;i" i: 
as? 


ORF Name NTID 


AAID Length 


f 


2L42mfttt...a2...1t 3... i 9 0 2 


9124 1S& | 




Protein name 




i y 

h 

■sj ;i;;r 


phosphor lJDOsylaminoimxclazole 
(pure) PAB1077 


carboxylase 


;!!) .nsi. 


Description 




ass; 


ORF Name NTID 


NT 

AAID Length 


if Hi 

%! iiW 


2M0..7SJ.£)...±1...S. 3S03 




3 :i 
'it 


Protein name 





AA 



5.8e-46 



Locus Name 



pir :B75013 



Acc# 



B75013 



AA 



Score Probability 



W7T 



Locus Name 



Acc# 



Description 
[NO-HIT 



1027 



NT 



AA 



ORF Name 



NT ID 



AAID 



244093S3 rl 31 



9126 



Length Length 



Score Probability 
±.4e-I9 



2T¥ 



Protein name 



Locus Name 



Acc# 



hypothetical 2 3 . bK protein (ginA-tctfiE 
intergenic region) .-hypothetical protein o2 06 



pir :S4082S? 



Description 



NT 



AA 



ORF Name 



NT ID 



2A6.16.9.3,...tl...l8. I 13905 



AAID Length Length 

wm — 



Score Probability 
TT7 



l.Se-13 



Protein name 



Locus Name 



hypothetical protein SC6C5.12C SC6C5.12c 



pir :T35483 



Acc# 



T35483 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



1680 



Score Probability 




4 .3e-121 



Protein name 



Locus Name 



undine kinase-reiated protein 



pir :B72341 



Acc# 



B72341 



Description 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Protein name 



TTZT 



1002 



5TT 



Locus Name 



1.3e-50 



Acc# 



riboflavin Kinase 



pir:D703l3 



Description 



D70313 



ORF Name 



NTID 



NT AA Score 

AAID Length Length - CQre 



\2BM20.10..±1..± 



JUT 



Probability 
|1.2e-27 



Protein name 



Locus Name 



hypothetical protein 



pir :F72424 



Acc# 



F72424 



Description 



1028 



ORF Name 



NT ID 



NT AA 

* S COITG 

AAID Length Length 



cl lib 



3T3T" 



TIT 



Probability 
I ll.ie-15 



Protein name 



Locus Name 



sensor histiaine Kinase 



] [pir:A72i8J " 



Acc# 



A72383 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
— 



Score Probability 
|2.4e-S4 



Protein name 



Description 



Locus Name 



sp:ATcJlJ)l<Jbl 



Acc# 



P54678 



CA TI ON -TRANSPORTIN G M'J^A^K MT1, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 





Score Probability 
|3.ie-54 



Protein name 

"sensory transduction nistiaine kinase 
;lr2104 rprotein slr2104 : protein slr2104 



Locus Name 



foirtSVblib 



Acc# 



S75136 



Description 



NT 



— score Probability 



ORF Name 



NTID 



l±k\lA±lxb.±±~±l 



AAID Length Length 



273 



Protein name 



Description 
MO-HIT ~~ 



Locus Name 



Acc# 



1029 



ORF Name 



NT ID 



T$TT 



NT AA Sc o r e Probability 

AAID Length Length 

l.le-bO 



Protein name 



amxnopept ida s e 



Locus Name 
gp:AF0410i3" 



Acc# 



AF041033 



Description 



Shigella riexnen ammopeptidase ipeppj gene, compieue c3s~ 



NT 



AA 



ORF Name 



NTID 



32219042_c3_iyb 



AAID Length Length 



Score Probability 



|2.4e-47 



Protein name 



Locus Name 



Acc# 



sp:RP54_A0JeA | P33983 



Description 
RNA R)hVMEkAaiii ril(JMA -b4 FAC'l'uk 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
£3T 



|4.5e-2 3 



Protein name 

"transcription regulator homoiog yovu^ 



Locus Name Acc# 
] ^ ±TiCi> ^ i 1 c69931 



Description 



ORF Name 



NTID 



^ M score Probability 

AAID Length Length 



3Liaa^i^c3i».iaa..- I |3 9i6 i 



fTTTT 



|3.0e-6b 



Protein name 
gcpe protein 



Locus Name 
"I [pir:imoB7 



Acc# 



E72087 



Description 



1030 



NT 



AA 



ORF Name 



NT ID 



24485592 t2 51 



AAID Length Length 



Score Probability 



9139 



RT7TT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



3LSAA12£l...cl.«aa I I^TF 



AAID Length Length 



1170 



Score Probability 
T5TT 



|2.6e-12 



Protein name 



Description 



Locus Name 



Acc# 



spiQPOT HUMAN 



( OLUTAMINYL - TRNA CYCLOTRANS PHRASE) {CLUTAMINYL CYCLASE ) 



ORF Name 



NTID 



NT AA 

„„ Tr ^ ^ — ^ T — ^ n Sc ore Probab ility 
AAID Length Length — JL 



Z6A±1121..±±...lh I \T$T§ 



TMT 



2.7e-£4 



Protein name 



Locus Name 



calcium motive P-type ATPase 



gp:AFl452S2 



Acc# 



AF145282 



Description 



Trichomonas vaginalis calcium motive P-type ATPase (CA-2) gene , partial cds . 



ORF Name 



NTID 



NT AA 

— ■ — , Score Probability 
AAID Length Length JL 



\&ii8±i5....c±..±&o.. . I mru 



2.2e-2i 



Protein name 
Description 

HYPOTHETICAL 17.7 KD PROTEIN SLR1419 



Locus Name 



Acc# 



sp:YB13_SYNY3 | P74523 



1031 



ORF Name 



NTID 



AAID 



Protexn name 



Description 



NT AA score Probability 

Length Length ~~ 

i.0e-30 



T3TT 



Locus Name 



Acc# 



P23884 



GLYCINE (JLEAVAGifcl ri^d' l 'JBM H Pku'lfcillsl 



ORF Name 



■4475705 ri 14 



Protein name 



NTID 



AAID 



5144 



NT 



AA 



Length Length 




Score Probability 



TBTT 



Locus Name 



Acc# 



Description 



'Q-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



T5T 



2 . Oe-ll 



Locus Name 



|sp:Y6lA_Mt!'rjA 



Acc# 



P81310 



HYPOTHET I CAL PkoT EIKI MJ Ob 11.1 



NT 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



5146 



4 . 6e-3b 



Protein name 



hypothetical protexn gcpk! 



Locus Name 
1 |P ir:E7ibbir ~ 



Acc# 



E71562 



Description 



1032 



ORF Name 



9782^28 ±2 b2 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 



Score Probability 



pr 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



'aft2afti2L.ti.jia I tsszg 



AAID Length Length 

wr%% — 



Score Probability 
TIT 



Locus Name 



conserved hypothetical integral membrane 
protein TP0771 



bir:H7i283 



Description 



4 .3C-73 



Acc# 



H71283 



NT 



AA 



ORF Name 



NTID 



AAID 



10.MM5.2l...c1..6.12 1 



9149 



Length Length 



Score 



Probability 
1.2e-27 



Protein name 



Description 



Locus Name 



Acc# 



lsp:YlCI_ECOLI | 



HYPOTHETICAL SS . i KD &ROTBIM Itf GLTO-SELC 1NTSRCENTIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



3928 



9150 



Length Length 
TulT 



Score Probability 



TIT 



Protein name 

Description 
lislO-HIT 



Locus Name 



Acc# 



1033 



NT 



AA 



ORF Name 



NTID 



r3 266 



AAID Length Length 



IT 



"ITT 



Score Probability 
8.7e-05 



TUT 



Protein name 



Locus Name 



transposase 



gp:AF038866 



Acc# 



AF038866 



Description 



BacteroicLes tragilis transposon Tn5520 transposase (bipH) andmobilization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



NTID 



NT AA 

— 1 , — ■ , Score Probability 
AAIP Length Length — JL 



9152 



i.5e-7$ 



Protein name 

Description 
HYPOTHETICAL 43.2 KD PROTEIN SLL0260 



Locus Name 



s^TYSSTTSYNTT 



Acc# 



P74409 



NT 



AA 



ORF Name 



NTID 



AAID 



9153 



Length Length 
T7~ 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— ^ n _ — ^ Scor e Pr obability 
Length Length — ■ — 



xa.7.3.y.3.s..7....r.^....3.3.a i 13932 



2 . le-20 



Protein name 



Locus Name 



transposase 



gp:AF03886S 



Acc# 



AF0388.66 



Description 



Bacteroides tragilis transposon Tn552 0 transposase (bipH) andmobilization 
protein BmpH (bmpH) genes, complete cds. 



1034 



NT 



AA 



ORF Name 



NTID 



AAID 



10739526 Cl 416 



Length Length 
7TT 



Score 



2142 



Probability 
|2.9e-li0 



Protein name 



Locus Name 



alpha -glucosidase 



gp:BTU66«9? 



Acc# 
U66897 



Description 



Bacteroides thetaiotaomicron neopullulanase (susA) andalpha -glucosidase 
(susB) genes, complete cds . 



ORF Name 



NTID 



NT AA 

, T — _ — ^, Scor e Pr obabili ty 
AAID Length Length JL 



10S7S675 tl 20 



9156 



Protein name 



probable . purine NTPase PAB0812 



Description 



T7T" 



Locus Name 



pir :F75103 



|2.4e-0$ 



Acc# 



F75103 



NT 



AA 



ORF Name 



NTID 



AAID 



iiai£iafi...ti...iifi i p^tf 



Length Length 



Score Probability 



Protein name 

Description 
WO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



'm£ftfi..±1...13L ...J 



Length Length 
^TT3 



Score Probability 



169 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1035 



NT 



AA 



ORF Name 



NT ID 



I20943S r2 211 



AAID Length Length 
TT7Z — 



Score Probability 




5 . Oe-47 



Protein name 



Locus Name 



immuno reactive 42JcD antigen PG33 



gp:AF175715 



Acc# 



AF175715 



Description 



Porphyromonas gingivalis strain W50 immunoreactive 42JcD antigenPG33 gene, 



complete cds 



NT 



AA 



ORF Name 



123163$2 c2 353 



NT ID AAID Length Length 




Score Probability 

wn — 



|2-..Be-4B 



Protein name 



Locus Name 



O-acetylhomoserine sultnydryia.se 



pir :D72324 



Acc# 
D72324 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



I2!il6AiflL±3L...2Lai I 1333? 



Length Length 



Score Probability 



Protein name 

Description 
IMO-HIT ~ 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



125.8.2..7.&5....t3....2&l.. 



9162 



Length Length 



Score Probability 



T5T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1036 



NT 



AA 



ORF Name 



NTID 



— — ' , T — ^ Score Probability 
AAID Length Length JL 

stss — 



7W 



4 . 2e-36 



Protein name 



Locus Name 



Acc# 



hypothetical protein F19D11 . 16 hypothetical 
protein F14M4 . 29 :hypothetical protein F14M4.29 



pxr :T02689 



Description 



ORF Name 



NTID 



mmiiLciiii i 



Protein name 



O-acetylhomoserine sulthydrylase 



Description 



NT 



AA 



AAID Length Length 
T5T 



7TT 



Score Probabi lity 
7TB 



4 . 8e-74 



Locus Name 



pir :D72324 



ACC# 



D72324 



NT 



AA 



ORF Name 



NTID 



AAID 



Iiftisai7...±i.„ii5 1 fT$n 



Length Length 
ST" 



Score Probability 



or 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



^A0A111B...±2..20.1 1 



9166 



Length Length 
T31 



Score Probability 
1.9e-08 



TT5 



Protein name 



Locus Name 



hypothetical protean PH1147 



pir :E71056 



Acc# 



E71056 



Description 



1037 



NT 



AA 



ORF Name 



NTID 



AAID 



14155287 £3 255 



3945 



SIFT 



Length Length 
RTT5" 



T3TF" 



Score Probability 

i^s 



4 .4e-09 



Protein name 



Description 



Locus Name 



|gp:J?GU5020B 



Acc# 



U60208 



Porphyromonas gingivalis ortl, orf2 and orf3 genes, complete cds . 



NT 



AA 



ORF Name 



14453437 c3 5$2 



NTID AAID Length Length 

— 



Protein name 



hypothetical protein ycgF 



Description 



Score Probability 
2uTT — 



Locus Name 



pir :A69758 



Acc# 



A69758 



ORF Name 



NTID 



NT AA 

_ TT> T — ^ T — ^ Score Probability 
AAID Length Length 



IA.7.45112..±3L.2fta I |35?7 



TUT" 



10.013 



Protein name 



Locus Name 



NADH dehydrogenase summit 2 



gp:AF160864 



Acc# 
AF160864 



Description 



Tetrahymena pyritormis mitochondrial DNA, complete genome. 



ORF Name 



NTID 



NT AA 

_ tt~\ t — ^ x — ^ Score Probability 
AAID Length Length — 



l£A7.4ftA2..±:L..iaft I P34S 



TTJS" 



ITT" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1038 



NT 



AA 



ORF Name 



15631502 tl 99 



NT ID AAID Length Length 

9T7T — 



TTZT 



Score Probability 
TOTS 



1.2e-33 



Protein name 



Description 



Locus Name 



sp:YHCG_ECOLT 



Acc# 



P45423 



HYPOTHETICAL 43.3 KD 1M GLTS'-NANT IMTfiftflElrfie REG10W 10375) 



NT 



AA 



ORF Name 



NT ID 



157077*58 c5 535 



AAID Length Length 
9172 



Score Probability 
3¥5 



l.le-35 



Protein name 



Locus Name 



transposase 



gp:AF038866 



Acc# 



AF038866 



Description 



Bacteroides tragiiis transposon Tn552U transposase (bipH) andmobiiization 
protein BmpH (bmpH) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T5~ 



Score Probability 



Protein name 

Description 
(NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length A ~ 



ifimafift...ci...455 1 ftsf? 



3.4e-16 



Protein name 
Description 

ACETYL ESTERASE, ( ACETYLS KLuaibAriU) 



Locus Name 



sp:XyNC_CALSA 



Acc# 



P23553 



1039 



ORF Name 



16594202 rl 73 



Protein name 

Description 
IMO-HIT 



NT 



AA 



NT ID 



AAID 



TOT 



Length Length 



Score Probability 



T3T 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



hypotnetical protein 



Description 



NT AA , , . , . 
— ^ — _ Score Probability 
Length Length ^ 



1723 



2.3e-i77 



Locus Name 



pir : JQ1020 



Acc# 



JQ102 0 



NT 



AA 



ORF Name 



NTID 



AAID 



lftm&..±i...23.1 1 



Length Length 



Score Probability 
2313 



6. 5?e-240 



Protein name 



Description 



Locus Name 



sp:PFL_CLOPA 



Acc# 
Q46266 



FORMATE ACETYLTRAMSPHftASE, { PYRUVATE F6&MATE- LYASE) 



NT 



AA 



ORF Name 



I^££.2...a3....£.u5. I 



NTID AAID Length Length 




TTTT 



Score Probability 
FF7 



5.9e-55 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



|pir:J«027 



Acc# 



JC6027 



Description 



1040 



ORF Name 



19617^ cJ 559 



Protein name 



NT ID 



AAID 



9179 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
INO-HIT 



ORF Name 



NT ID 



NT AA 
T ~\ , — ^, S core Probabi lity 
AAID Length Length — JL 



±9.1D£.2$.B....al...5Al I 



wnnr 



4 . Oe-09 



Protein name 



Locus Name 



transposase 



gp:AP038866 



Acc# 



AF038866 



Description 



Bacteroides fragilis transposon Tn5520 transposase (bipH) anclmob ilization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



NTID 



i5.a.7.ai.7....ci...3.s.a. 



Protein name 



9T3T 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



ia0^0xa....al^21.. I 



Protein name 



AAID 



NT AA 
T^t-v, K Score Probability 
Length Length 



TUT 



Locus Name 



Acc# 



Description 
INO-HIT 



1041 



NT 



AA 



ORF Name 



20213303 c2 519 



NT ID 



AAID Length Length 
STSl — 



Score Probability 
TFI 



7.1e-16 



Protein name 



Locus Name 



ATP - dependent activating enzyme 



gp : PFFBSCKAB 



Acc# 



Y09356 



Description 



Pseudomonas tluorescens tbsC, f bsE, ±bsA and tnsB genes. 



NT 



AA 



ORF Name 



NTID 



AAID 



9184 



Length Length 
715 



Score Probability 



Protein name 

Description 
M6-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
T — T — _ Score Probability 
AAID Length Length A - 



13553 



1065 



3 . Oe-10 



Protein name 



Locus Name 



transmembrane sensor 



gp:AP05163l 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A (pstA) , ECF sigma tactor ( HuIJ , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



'mSMiiLtUM. I 



Length Length 



Score Probability 
FUI 



1.2e-79 



Protein name 

Description 
MUTS2 ftftOTEIM- 



Locus Name 



Acc# 



P94545 



1042 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3.9e-"51 



Protein name 



Locus Name 



amidophosphoribosyl transferase 



pir:H6S>185 



Acc# 



H69185 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



2.12L52..7..7....tl...lL3 t I 13966 



Length Length 
— 



Score Probability 
— 



'S.4e-272 



Protein name 



Locus Name 



alJtyl hydroperoxide reductase subunit F 



gp:AF129406 



ACC# 
AF129406 



Description 



Bacteroides iragilis aikyl hydroperoxide reductase subunit C (ahpo and 
alkyl hydroperoxide reductase subunit F (ahpF) genes, completecds. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1±&6&11L±±...±± I PW7 



3.ie-38 



Protein name 



Locus Name 



Acc# 



transcription regulator yggG 



pir:G65078 



Description 



ORF Name 



NTID 



21£.0.b.28.S...±2...13.i I 



Protein name 



hypothetical protein b2228 



Description 



NT 



AA 



AAID Length Length 
9190 



Score Probability 
93 



Locus Name 



pir ;B64993 



0 . 0019 



ACC# 



B64993 



1043 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



Score Probability 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



2.1B.7.B.0.S....C2...SD.7. I 



AAID Length Length 
Wim 



FT 



Score Probability 
73 ~ 



'0,019™ 



Protein name 



Locus Name 



putative transmembrane protein 



gp:SCU56107 



Acc# 



U96107 



Description 



Staphylococcus carnosus N5 ,N10-methylenetetrahyclromethanopterinreductase 
homolog, SceB precursor (sceB) and putative transmembraneprotein genes, 
complete cds, and putative Na+/H+ antiporter NhaC(nhaC) gene, partial cds. 



ORF Name 



NT ID 



AAID 



NT AA 

— , „ — - . Score Probability 
Length Length — — — —. 



iiCL44ai.±i...iai I pttt 



TIT 



WT 



0.64S 



Protein name 



Descri ption 



Locus Name 



sp:PRIM_LISMO 



ACC# 



P47762 



DNA PRIMASE, 



ORF Name 



NT ID 



AAID 



NT AA 
„ — ^, T — . Score Probability 
Length Length 



212,&$.a.6±.±l...±±2 1 KTTZ 



TOT 



TuT" 



321 



Protein name 

Description 
IKO-HIT 



Locus Name 



Acc# 



1044 



ORF Name 



Protein name 



NT ID 



AAID 



JTTT 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



\ii&M&2&.±2...ibA I rrrrz 



Protein name 



NT 



AA 



Length Length 
72— 



Score Probability 



TTT 



Locus Name 



Acc# 



Description 



% il!S' 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



a2a&am..±i...itt£ i mrs 



9197 



FT" 



0.031 



Protein name 



Description 



Locus Name 



sp:SPRCXENLA 



Acc# 



P36378 



(OS T EONEC TI N) (ON) (BASEMEN T MEMBRANE PROTEIN BM-40) 



ORF Name 



NTID 



NT AA 

— — , Score Probabi lity 
AAID Length Length 



aia2Laaii...ci„A4i I 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1045 



NT 



AA 



ORF Name 



NTID 



AAID 



23445317 Cl 458 



\IWTT 



Length Length 
TTTT 



Score Probability 
TZ1 



3.5e-7i 



Protein name 



Locus Name 



conserved hypothetical protein BB0682 



pir :A70185 



Acc# 



A70185 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



^^Uy.5.X.7„,.C3....6.13. I 13978 



Length Length 



TU7T 



Score Probability 

*m — 



1.3e-48 



Protein name 



Description 



Locus Name 



gp:A00047 



Acc# 



A00047 



E.coli mor gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



21&l&10±.,.cl..£M I PT7^ 



Length Length 
SOT" 



WIT 



Score Probability 

in — " 



|3.4e-06 



Protein name 



Locus Name 



AmpG- signal transducer 



'gp:ECAMPG3 



Acc# 



X82159 



Description 
E.coli ampG3 gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



116A&6.1S....CX...16A I 



Length Length 




Score Probability 

iui — 



Protein name 



Locus Name 



hypothetical protein A2 08R 



pir :T17698 



Acc# 



T17698 



Description 



1046 



NT 



AA 



ORF Name 



NT ID 



^78150 c4 



SWF 



AAID Length Length 
1599 



Score Probability 
TTT2 — 



Protein name 



Description 



Locus Name 



sp:RF3_EC0LI 



ACC# 
P33998 



PEPTIDE CHAIN RELEASE FACTOR 5 



NT 



AA 



ORF Name 



NT ID 



AAID 



3982 



Length Length 



Score Probability 



T7T 



Protein name 

Description 
INO-HIT — 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



ll&9.116.1...c±JX$A I 



AAID Length Length 

wars — 



WIT 



1254 



Score Probability 
TI$% — 



Protein name 



Description 



Locus Name 



sp:PEPT_BACSU 



Acc# 



P55179 



PEPTIDASE T, { AMIN0TRIPEPTIDA5E ) (TklPEPTIDASE) 



NT 



AA 



ORF Name 



NTID 



2iri9£ll...a±JX\15. 



13984 



AAID Length Length 



1020 



Score Probability 
1381 



4 . Oe-141 



Protein name 



Locus Name 



class A beta- lactamase CFXA2 precursor 



gp:AF118110 



Acc# 
AF118110 



Description 



Prevotelia intermedia class A beta- lactamase CFXA2 precursor (ctxA2 } gene, 
complete cds . 



1047 



NT 



AA 



ORF Name 



NTID 



AAID 



124397187 c3 5£2 



Length Length 



Score Probability 



Protein name 

Description 
IMO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



|M41Au2£..±2...I5.u.. 



AAID Length Length 



Score Probability 
~ 



4.1e-36 



Protein name 



Locus Name 



Dps 



gp:AB0U5779 



Acc# 
AB025779 



Description 



Porphyromonas gingival is gene tor Dps, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



1ZT 



Score Probability 



1.9e-43 



Protein name 



Locus Name 



Acc# 



sp : YBAL_EC0L1C | 



Description 

HYPO T HETICAL 59.4 KB PROTEIN I N flSK- F Sk IN T HRQ B NIC REGION 



ORF Name 



NTID 



NT AA 

— , *r — 1 Score Probability 
AAID Length Length 



13988 



TUT 



TEW 



7,le-10 



Protein name 



Locus Name 



nucleotide pyrophosphatase homolog T16L4.210 



pir :T09933 



Acc# 



T09933 



Description 



1048 



ORF Name 



NTID 



AAID 



— , — , Score Probability 
Length Length J ~ 



24782732 c3 60S 



S2TT 



1188 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



\2&&AA&±l...cl..£Al 



WITT 



7TF 



T7T 



3.2e-05 



Protein name 



Description 



Locus Name 



Acc# 



sp : EBA2_FLAME "j P36912 



( ENDOGLYCOS IDA5E P2) 



NT 



AA 



ORF Name 



NTID 



24ft!i3.m...tl..M I P^T 



AAID Length Length 



Score Probability 
52B 



?.8e-51' 



Protein name 



divalent cation transport -related protein 



Locus Name 
[pir:H723£3 



Acc# 



H72360 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



^XT- 



Length Length 



Score Probability 



F7T 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1049 



NT 



AA 



ORF Name 



NTID 



AAID 



|24&4a928 cl 459 



9215 



Length Length 
[2uT 



Score Probability 



IBP 



Protein name 

Description 
KO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TU 1 — 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



1 



AAID Length Length 
5217 



RT2T 



TTTT 



Score Probability 
T7Z ' 



Protein name 



Locus Name 



Acc# 



sp:YIDA_ECOLT 



Description 

HYPOTHETICAL 29.7 KD PROTEIN IN IBPA-GYRB INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



253,9.8.3.8.5...a2L...&6.1 1 13 996 



9218 



Length Length 



1041 



Score Probability 
" 



Protein name 



Locus Name 



hypothetical protein F14F9.5 



pir:T33774 



Acc# 



T33774 



Description 



1050 



NT 



AA 



ORF Name 



NTID 



AAID 



Iiib422i62 tl B 



Length Length 
fT7W 



Score Probability 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
75 



Score Probability 
73 



0.024 



Protein name 



Locus Name 



HCG-1 protein 



gp:AP044219 



Acc# 



AF044219 



Description 
Drosophila melanogaster HCG-1 protein 



(HCG-1) mRNA, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



9221 



Length Length 
TTT~ 



Score Probability 
T73 



4.1e-13 



Protein name 



Locus Name 



thiorecloxin-like protein 



|gp:ATAOTl071S 



Acc# 
AC010718 



Description 



Arabxclopsis thaliana cJiromosome I BAC 
sequence . 



F28016 genomic sequence , complete 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — — — — 



25.a9.21&:/....CU...5.9.& 1 14000 



WTTT 



1.3e-S7 



Protein name 



Locus Name 



probable cLTDP-L-rhamnose synthase 



pir :T31087 



Acc# 



T31087 



Description 



1051 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2635963b_cl_4uy 


| 4001 


9223 


373 


1122 


280 


1 . >C Zi *± 


Protein name 








Locus 


Name 


Acc# 










sp:ENTcJ 


_EC0L1 


P10377 


Description 














"ISOCHGftiaMA'rtl 


SYNTHASE is!JMT<J , 










i 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 
6.Se-43 




|4002 


9224 


579 


1740 


454 





Protein name 



Description 



Locus Name 
|sp:MEMD__HAJbllJsl 



Acc# 



P44612 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score 



9225 



T5T" 



11056 



Probability 
, 7.6e-6"C 



Protein name 

"probable zinc-contammg aenyarogenase 



Locus Name 



1 [pir:Tafeyfer 



Acc# 



T36961 



Description 



ORF Name NTID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 
1.5e-0b 


Z55ITD.12L.7^ci^iaj. 92 2 6 


145 


428 111 




Protein name 




Locus Name 


Acc# 


terric uptake regulation protein 




] pir:G722li 


G72213 



Description 



1052 



NT 



AA 



ORF Name 



NTID 



AAID 



25501577 cl 412 



WITT 



Length Length 
T71T 



Score Probability 
|5.Ie-102 



TTITT 



Protein name 



Locus Name 



naphtnoate synthase, menB : DHNA 
synthase : dihydroxynaphthoate 
synthase idihydroxynapthoic acid synthetase 



pir :F65*656 



Description 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



25504557 t2 133 



Length Length 
TIT" 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



ffuTTT 



AAID Length Length 
|4§5 



Score Probability 
TIW 



Protein name 



Locus Name 



transposase 



gp:AF03SS55 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5520 transposase (bipH) andmobi l ization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



NTID 



AAID 



NT AA 

— ^ y — . Score Probability 
Length Length 



14008 



TTT 



33b 



9.2e-07 



Protein name 



Locus Name 



Hypl protein 



IgprHVHYPlPRO 



Acc# 



Y09797 



Description 
H. vulgaris mRNA tor Hypl protein. 



1053 



NT 



AA 



ORF Name 



NTID 



\%TJUTT 



AAID Length Length 
WIT1 — 



Score Probability 
W5 



0.0053 



Protein name 



Locus Name 



asparagine-nch protein (clone 28C6) 



pir :S1447tl 



Acc# 
S14470 



Description 



NT 



AA 



ORF Name 



NTID AAID Length Length 

win — 



&ZTT 



Score Probability 
T£Z 



l.Se-31 



Protein name 



Locus Name 



Acc# 



Sensor protein RcsC (EC 2. 7. 3. -J 



gp:D90850 



Description 



E.coli genomic DNA, Kohara clone #373 (49 . 5-49 . 9 min.) 



ORF Name 



NTID 



NT AA 
T — _ — _ Score Probability 
AAID Length Length ^ 



iaiasaAZ...c2L...5i3L& ..i .14 o i i 



WITT 



Protein name 



hypothetical protein 



Description 



WIT 



TTTT 



TIT 



S.Se-25 



Locus Name 



pir :C72285 



Acc# 



C72285 



ORF Name 



NTID 



AAID 



NT AA 

— ^, — ^ Score Probability 
Length Length 



\10&02155...±2..±ll I F0T7 



TUTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
t — ^ T — 1 _ 1 Score Probability 
Length Length 



TTW 



Protein name 

Description 
(NO -HIT : 



Locus Name 



Acc# 



1054 



NT 



— ^ S core Probability 
T37TJT3 



ORF Name 



NT ID 



AAID 



32478803 c2 bb2 



4014 



Length Length 



TTTT 



Protein name 



Description 



Locus Name 
|sp:CPElj40VlW 



Acc# 



018963 



eVTOCHkoMM £450 2tiX, (CV^ll^l) 



NT 



— Score Probability 



ORF Name 



NTID 



AAID 



32523b7fo ti 



WITT 



Length Length 



FT 



Protein name 
Description 



Locus Name 



Acc# 



WfO-HIT 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


ll±$A±l...ai...6.2& 


4016 


5238 


|173 


522 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


mi0.^6...±1....4b. 


....I 4017 


| 5239 


214 


545 | 


163 


- 4.7e-l2 | 



Protein name 



Locus Name 



"RNA polymerase sig ma tactor Sigz-nke protein | j gp : afijv/ST 



Acc# 



AF137263 



Description 

Bacteroides thetaiotaomicron 3 us riposomai protein 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . ^ 



SIS - likeprotein, tucose 



1055 



ORF Name 



33400263 cl 423 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



1017'' 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



imafiftai.±i...aaft j 



Protein name 



NBUl mobilization protein mob 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



I20S4 



Locus Name 



pir :A49901 



|1.3e-2i5 



Acc# 



A4 99-01- 



ORF Name 



Protein name 



NTID 



M0.2£5.5.8...±l...:5.6. I |37T2u" 



AAID 



NT 



AA 



Length Length 




Score Probability 



55 



Locus Name 



Acc# 



Description 
INC -HIT 



ORF Name 



NTID 



MM.M.D.U...c1..A2l2l i FTTTI 



Protein name 



AAID 



NT AA 

— L1 Score Probability 
Length Length 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



hypotnetical protein PAB0U40 



Description 



NT AA 

— , — , Score Probability 
Length Length 



T2T 



8 . 6e-20 



Locus Name 



pir :B75194 



Acc# 



B75194 



1056 



NT 



AA 



ORF Name 



NTID 



AAID 



35161302 c2 524 



14023 



Length Length 
[2TT" 



or 



Score Probability 
TFS 



II. 0e-10 



Protein name 

Description 
HYPOTHETICAL PkOf&Wt HI0350 (Oftrt) 



Locus Name 



sp:Y350_HAEIN 



Acc# 
P24326 



NT 



AA 



ORF Name 



NTID 



36073551 cl 389 



AAID Length Length 




3US" 



Score Probability 




|3.§e-33 



Protein name 



Description 



Locus Name 



sp:YZ0$JffCTU 



Acc# 



Q10543 



HYPOTHETICAL TRNA/RRNA METHYLTJJAWSFERASE CY31.09, 



NT 



AA 



ORF Name 



NTID 



AAID 



3.6.2.D..7.a3.3....al...^l3. | 



9247 



Length Length 
^5" 



Score Probability 
TT5 



l.le-iO 



Protein name 



Locus Name 



chloromuconate cycloisomerase homolog yJctB 



pxr:H«955 



Acc# 



H69855 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



16.16±0A1..±1...16.1 J WTTTZ 



Length Length 



Score Probabi lity 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1057 



NT 



AA 



ORF Name 



:J94468ti t3 344 



WUT7 



NT ID AAID Length Length 

— 



Score Probability 
6 . Oe-99 



Protein name 



Locus Name 



aiJcyl hydroperoxide reductase subunit <J 



gp:AF12940fe 



Acc# 



AF129406 



Description 



Bacteroides tragilis alky I hydroperoxide reductase subunit C (ahpC)and 
alkyl hydroperoxide reductase subunit F (ahpF) genes, completecds. 



NT 



AA 



ORF Name 



NTID 



5547562 c2 534 



14028 



AAID Length Length 

vz5§ — 



1203 



Score Probability 
|4.2e-75 



Protein name 



Locus Name 



transposase 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5520 transposase (£>ipHj andmomiization 
protein BmpH (bmpH) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TTT8 — 



Score Probability 



TIE 



Protein name 



Locus Name 



hypothetical protein PH0922 



pir :D/1082 



Acc# 



D71082 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



40.MS.8.7....C.1...AII I WUJU 



9252 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



iwo-Hrr 



1058 



NT 



AA 



ORF Name 



NT ID 



14094400 t3 ^44 



AAID Length Length 



Score Probability 
?5S — 



|4.8e-83 



Protein name 



Description 



Locus Name 



sp : SDHLjSTRCO 



Acc# 



086564 



L-SfiftltiB OmDftM ' AaE, (L-&&'klKffi DEAMINASE) (SDH) (L-SS) 



ORF Name 



NT ID 



NT AA 
T — _ — _ Score Probability 
AAID Length Length ^ 



45<J4«2 cl 402 



4032 



|1.4e-05 



Protein name 



Locus Name 



intracellular Hyaluronic acici binding protein 



gp:AF032S<52 



Acc# 
AF032862 



Description 



Homo sapiens intracellular hyaluronic acid, binding protein (IHABPJ mRNA, 
complete cds . 



ORF Name 



NT ID 



I4m&2£...c2...5aa I mrn 



Protein name 



NT AA 

— ■ — , Score Probability 
AAID Length Length J - 



1775" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— , - — , Score Probability 
Length Length 



\&8M$2L±1..326. I WUTZ 



T5TT 



5.6e-3i 



Protein name 

Description 
(CHONDROiTINASE) 



Locus Name 



'sp:<3A6SJK»JMT 



Acc# 



P34059 



1059 



NT 



AA 



ORF Name 



NT ID 



14939000 c2 525 



37735" 



AAID Length Length 
1335- 



925 7 



TT5TT" 



Score Probability 
TTTT7 — 



iTle-104 



Protein name 



Locus Name 



conserved hypothetical protein 



pir :B72278 



Acc# 



B72278 



Description 



NT 



AA 



ORF Name 



CTHIST7IcX3MZZZ3] 14036 



NT ID AAID Length Length 

mzs — 



Score Probability 

m — 



i.5e-14S 



Protein name 



Locus Name 



cation- transporting atpase, p- type (pacs) 
PAB0626 



pir:EV5141 



Acc# 
E75141 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



isiaaai..ai...ifis i mrr 



Length Length 
T7T~ 



Score Probability 



Protein name 
Description 

p^riTrr 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 
* — ^1 — Score Probability 
Length Length 



isims2...ci...fiifi I fsis 



[TTT" 



0.00075 



Protein name 



Locus Name 



hypothetical protein F42G9.3 



pir:T15348 



Acc# 



T16348 



Description 



ORF Name 



NT ID 



AAID 



Protein name 

Description 
EPTO^TTT 



NT AA 

— — Score Probability 
Length Length 4L 



Locus Name 



Acc# 



1060 



NT 



AA 



ORF Name 



5273552 C3 639 



^ mn .^ — , , — _ Score Probability 
NTID AAID Length Length — 





FTuTO" 



[¥3~7~ 



Protein name 

Description 
py^HTT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
_ — ^ T — j n Score Probability 
AAID Length Length ^ 



14041 



TUT" 



0.0047 



Protein name 



Description 



Locus Name 



Acc# 



gp:PMAL3E>7 



Plasmodium raicxparum MAL3P7, complete sequence. 



NT 



AA 



ORF Name 



NTID 



^ ^ _ — ^, _ — Score Probability 
AAID Length Length JL 



£&ma..±1...2£3L I |4MI 



T7T 



Protein name 
Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



6A2.$6Al...al..£±S. I 



NTID AAID Length Length 
13255 — 



Score Probability 
6.5.e-.15 



Protein name 



Locus Name 



sp:RK>£!JlAEIN 



Acc# 



P44790 



Description 

RNA POLYMERASE "SIGMA-E FACTOR (SIGMA- 24] 



1061 



ORF Name 



16444077 tl Al 



Protein name 



Description 



NT 

Length Length 



AA 

— Score 



Probability 
|1.4e-5a 



Locus Name 
sp : PFLAJii(JoLl 



ACTIVATING riN^^MlJ) 



ORF Name 



NT 



AA 



NTID 



AAID 



708467b tl iy 



|404 5" 



Length Length 
— 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



|N< 



©-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



.7.l0.3.40..7....ci....b.B.b.. 



14046" 



i.4e-2l" 



Protein name 



Locus Name 



|sp:^A0j i l<JuLl 



ACC# 



P76243 



Description 
HYPOTHETICAL 14 .'A KB PJkoTKIW IN (iAkA 



-RND lUTUkGENliJ kfckilufl 



ORF Name 



NTID 



AAID 



NT AA Score 

Length Length 



Probability 



3 



1.4e-6S 



Protein name 



Locus Name 
|sp:¥Umj*JuLl 



Acc# 



P33019 



Description 

HYPO T HETICAL ib.s Kb PROTEIN IN LVd^- 



1062 



NT 



AA 



ORF Name 



975780 ±3 333 



NT ID AAID Length Length 

192 70 



T5T 



Score Probability 
0.0034 



ST 



Protein name 



Locus Name 



troponin T, cardiac muscle : troponin T2 



Description 



pirrTPHUTC 



Acc# 



ORF Name 



9.22.7.3.3.a...G3....6.2.7.., 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



mil' 



Length Length 



Score Probability 



Locus Name 



Acc# 



iff 



ORF Name 



aa5A.7.5.7....C3....5.&l.. 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



T89 



Locus Name 



Acc# 



y 



ORF Name 



Protein name 

Description 
T &R0TEM) 



NTID 



AAID 



NT AA 
r ~ , , T — . , Score Probability 
Length Length 



3273 



TUFT 



3 ,8e-.81 



Locus Name 



sp:GCST_BAC!SU 



Acc# 
P54378 



1063 



ORF Name 



\2243U202 ±2 1 



Protein name 

Description 
IKTO-HIT 



NT 



AA 



NT ID 



AAID 



15274 



— ^ — _ Score Probability 

Length Length — 

TT1 

Locus Name Acc# 



7^ 



ORF Name 



Protein name 



NT ID 



AAID 



3215 



inner memJorane protein homolog 



Description 



NT AA 

— _ — ^ Score Probability 
Length Length JL 



BUT 



TOT 



Locus Name 



pir:A70155 



b . 6e-38 



Acc# 
A70155 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length Score PrQb ^bility 




T5T 



12 . 7e-09 



Protein name 



Locus Name 



Acc# 



transcriptional regulator 



gp:BSUB0017 



Description 



Bacillus subtilis complete genome (section xi of 21J : from 3197001to 
3414420. 



NT 



AA 



ORF Name 



NT ID 



imi5H2„tLlfii I 



AAID Length Length 
— 



1ST 



Score Probability 




'2.Se-I8 



Protein name 



Locus Name 



neat shocK: protein, class I 



|pir:D723S5 



Acc# 



D72385 



Description 



1064 



NT 



AA 



ORF Name 



NT ID 



AAID 



13860625 c2 279 



9278 



Length Length 
TTT 



Score Probability 
0.0048 



78 



Protein name 



Locus Name 



conservea hypothetical protein yulD 



pir: P 7 0U14 



Acc# 



F70014 



Description 



NT 



AA 



ORF Name 



NT ID 



\l&S£A0£±...cl..±S£. I 



AAID Length Length 
— 



FOB" 



TTTT 



Score Probability 
7.2e-ii9 



PT7T" 



Protein name 



Locus Name 



coenzyme F390 syntiietase II 



pir;B6^11b 



Acc# 



B69115 



Description 



NT 



AA 



ORF Name 



NTID 



U£±a5&3....al..£&£ I 



AAID Length Length 
1033 I im^ — 



Score Probability 
£56 



l.2e-50 



Protein name 



Locus Name 



sensory transduction histicline Kinase 
slr2098 .-protein slr2098 :protein slr2098 



ir:£75130 



Acc# 



S75130 



Description 



NT 



AA 



ORF Name 



NTID 



14fi&2ia2..±2...aQ I FTT5^ 



AAID Length Length 
9281 



[7TT 



Score Probability 
621 



4 .4e-72 



Protein name 



Tri r 4 allergen 



Locus Name 
|gp:AF0S2514 



Acc# 



AF082514 



Description 

Trichophyton rubrum Tn r 4 allergen mRNA, complete eels . 



1065 



NT 



AA 



ORF Name 



^-r^ * t> -rr^ r — ^ t — , ■» Score Probability 
NT ID AAID Length Length z - 

-57S2 — 



i55TT 



55F" 



T57T" 



1.0e-35 



Protein name 



Locus Name 



|sp:YUiF_BAtt«U 



Acc# 



P94408 



Description 

HYPOTHETICAL 53 .3 £>ft<5TfilM IN SF£>-<3BftKA ItfTfiftGEWIC REGION 



NT 



AA 



ORF Name 



1$704££2 cl 235 



NTID AAID Length Length 

— 



207 



Score Probability 
535~~ — 



5. 9e-53 



Protein name 



Locus Name 



CDP-4 -keto-6 -deoxy-D-glucose- 3-dehydrase 



Description 



pir :E47070 



Acc# 



E47070 



ORF Name 



Protein name 

Description 
IKfO-HIT 



NTID 



NT AA 
T — ^, _ — - Score Probability 
AAID Length Length z - 



l^lAlir.L±l..AA. I [¥uT2 



525T" 



T5T 



(57TT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



ia.7.a5.a^i...az...z3..7. ( 



NTID AAID Length Length 



¥75" 



1431 



Score Probability 
557" 



Protein name 



Locus Name 



'sprYBHFJKIOLl 



Acc# 
P75776 



Description 

HYPOTHETICAL ABC TRANSPORTER ATP-BIWDING PROTEIN YBHF 



1066 



ORF Name 



NTID 



NT AA 

„„„ T — ^ T — Score Probabilit y 
AAID Length Length JL 



1992182 r2 107 



TIT" 



ITT" 



Protein name 

Description 




Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

wzzi — 



W5T 



Score Probability 
51 



0.015 



Protein name 



Description 



Locus Name 



sp:YK5S YEAST 



Acc# 



P36158 



HYPOTHETICAL 65.5 1(E) PROTEIN IN SI52-MTM -INTEftGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



2Q3.21Q15....C2...242.. 



14 OS'S 



Length Length 



Score Probability 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



m&^A..cl..l2lA I ffulTT 



Length Length 



Score Probability 
1.2e-25 



Protein name 



Locus Name 



hypothetical protein C26D10.4 



pir ;T19486 



Acc# 



T19486 



Description 



ORF Name 



NTID 



NT AA 

„ „ ^ — ^. — L1 Score Probability 
AAID Length Length JL 



4068 



Protein name 



Locus Name 



hypothetical protein SC5C7.08 SCbC'7.08 



pir :T35215 



Description 



1.4e-35 



Acc# 



T35215 



1067 



NT 



AA 



ORF Name 



21b22003 C3 318 



NT ID AAID Length Length 

wzm — 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



12&18/2.ai...c±...211 1 F5TT7TT 



-r — ^ x — ^ Score Probabilit y 
NTID AAID Length Length *- 

mwi — 



TTT 



7 .4e-14 



Protein name 



Locus Name 



unKnown 



gp:AF048 74 9 



Acc# 
AF048749 



Description 



Bacteroides fragilis capsular polysaccharide bxosynthesis operon, complete 
sequence * 



ORF Name 



NTID 



NT AA 
T — T — Score Probability 
AAID Length Length 



4071 



TTT5T" 



17TT 



l.4e-44 



Protein name 



Locus Name 



'sp:YBHS_ECOLI 



Acc# 



P75775 



Description 

HYPOTHETICAL 42.1 KD PROTEIN IN MOAE-RHLE INTERGENIC REGION 



ORF Name 



NTID 



AAID 



NT AA 

— , — L1 Score Probability 
Length Length - — -£ - 



2&aiftA5fl...c2L.2afi. I 



1.4e-0S 



Protein name 

Description 




Locus Name 



IsprYHIIJECOLI 



Acc# 



P37626 



1068 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



23459536 t3 160 



mrrr 



TT7 1 |35T 



Protein name 

Description 
MO-HIT ~ 



Locus Name 



Acc# 



NT 



ORF Name 



NTID 



AAID 



14074 



9296 



Length Length 



AA 

— , Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



2.3.5.M0.5.5....cI...I^D. I [TOTS 



AAID Length Length 
TTST 



Score Probability 
10.00025 



Protein name 



Locus Name 



Sp:Y13B_BPT4 



ACC# 



P17308 



Description 

HYPOTHE T ICAL 11.5 KB PRO T EIN IN GP31-CD IKITBRcj^ NIC REGION (ORF B) 



NT 



AA 



ORF Name 



NTID 



AAID 



aifi42m..±i...ii ...J rttt^ 



Length Length 



Score Probability 



TTTT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



1069 



NT 



AA 



ORF Name 



NTID 



AAID 



23926577 cl 214 



14077 



Length Length 



Score Probability 



11458 



Protein name 

Description 
1N0-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



■2L3iaa&a&.7...xi...2^.. 



wunr 



Length Length 



Score Probability 
353 



6.7e-3.6 



Protein name 



Locus Name 



damage -inducible protein PAB02 43 



pir:A75151 



Acc# 



A75151 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
53BT — 



Score Probability 
1284 



7.ie-25 



Protein name 



Locus Name 



hypothetical protein MTH18.54 



lr : A69115 



Acc# 



A69115 



Description 



NT 



AA 



ORF Name 



NTID 



242.5..7.a3..7....al...2.0.i I KOTff 



AAID Length Length 
¥TU2 — 



3^3" 



Score Probability 



IT7 



Protein name 



Locus Name 



hypothetical protein PAB0603 



pir:S75137 



Acc# 



E75137 



Description 



NT 



AA 



ORF Name 



NTID 



2L44flftaast..±a.„i£t I F^r 



AAID Length Length 
93T73 



Score Probability 
2.ie-D6 



T3^ 



Protein name 



conserved hypothetical protein 



Locus Name 
toir :F75328 



Acc# 



F75328 



Description 



1070 



NT 



AA 



ORF Name 



NTID 



AAID 



^4648876 ±2 MfO 



9304 



Length Length 



Score Probability 



7TDF 



Protein name 

Description 
WO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



z4aza3.aa...cz...2L^i i 14083 



Length Length 




Score Probability 
T22 



1.4e-05 



Protein name 



Locus Name 



sp : XERCJSALTY 



Acc# 



P55888 



Description 
| INTEGRATE / ftBCOMBIWASB XERC 



ORF Name 



NTID 



AAID 



NT AA ^ ^ , . . _ , ^ 
— — Score Probability 
Length Length 



5.8e-il2 



Protein name 



Locus Name 



nelicase 



gp : RNDNAB 



Acc# 



Y13813 



Description 
Rhodothermus marinus clnaB gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



2.Stfeai&&2...ci...iaa 1 14085 



Length Length 
TUB" 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1071 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



26574061 ±2 10b 



9 3 C) 8 



7T5" 



Score Probability 
|8.5e-45 



Protein name 



sanA protein 



Locus Name 
] |P ir:iy/bb4 3 



Acc# 



D75549 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



TOT 



Score Probability 
|5.1e-b4 



Protein name 

Na-translocating N ADB-quinone reaucbase, Nqrs 
subunit 



Locus Name 
|pir:A7^9 



Acc# 



A72399 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



261A±0X'2.,..a±^20.0. 



Score Probability 
IS . 4e-272 



Protein name 



Description 



Locus Name 



|sp:tWkA>AcifcfU 



Acc# 



034863 



"EXCIMUCL'EASE AkC SUBUNIT A 



NT 



AA 



ORF Name 



NTID 



{261^^1^1^.21 1 



AAID Length Length 
19311 



Score Probability 



[5W 



^3" 



1.4e-b:4 



Protein name 



Locus Name 



Na- translocating NADH-qumone reauctase, JNqr4 
subunit 



|pir:HVaJ*a 



Acc# 



H72398 



Description 



1072 



ORF Name 



NTID 



2544087 c± 2Ui 



14050 



NT M score Probability 

AAID Length Length 

BZ5 



TTET 



1.3e-18 



Protein name 

"conserved hypothetical protein scezu.jjc. 



Locus Name 



gp:SC!K20 



Acc# 



AL136058 



Description 

Streptomyces coelicoior cosmia n!2u 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3025037__c3_287 


4051 


5313 


705 


2118 


220 


" |5.5e-21 



Protein name 



site-specilic recombmase 



Locus Name 
gp:DS£934 



Acc# 



D86934 



Description 



Staphylococcus aureus genes, mec region, partial ana compieue dag: 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


1119:±±ll...z2JlbA 


.... 4052 


5314 


115 


351 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lAO/lll&l^cl^ll 


-4053 


9315 


334 


1005 


766 


5.9e-7b 



Protein name 

Ma- translocatin g NADH-qumone reauctase, Mqr^ 
subunit 



Locus Name 
pir :F72^98 



Acc# 



F72398 



Description 



1073 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



34179712 £3 154 



2049 



6 . 2e-200 



Protein name 



Description 



Locus Name 



Acc# 



'sprWRiJJJAUSLI 



EXCMMASfi ABC SUMMIT B (61JMA PftOl'tild) 



NT 



AA 



ORF Name 



NT ID 



AAID 



3534462£ cl 201 



Length Length 



Score Probability 
7^ 



2 .le-3,2 



Protein name 

Description 
EB^C PROTEIN 



Locus Name 



sp:EBSC_EOTFA 



Acc# 



P36922 



NT 



AA 



ORF Name 



36.122U12L...C.2....2L3.9. I 14096 



NTID AAID Length Length 

mr$ — 



TUT 



Score Probability 

\m — 



7.5e-28 



Protein name 



Locus Name 



sp:YBHR_ECOLI 



Acc# 



P75774 



Description 

HYPOTHETICAL 41.5 KD PROTEIN IU MOAE-RHLE IKfTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



3.&.3.5.U8.12L...t.l...2L5.., 



Length Length 



Score Probability 



Protein name 

Description 
IWO-HIT 



Locus Name 



Acc# 



1074 



NT 



AA 



ORF Name 



NTID 



AAID 



14095 



Length Length 



TFFT 



Score Probability 
£T7 



a.le-32 



Protein name 



Locus Name 



conserved hypothetical protein yl£>K 



pir :H69874 



Acc# 



H69874 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



lMb.mi.±l...lll I 



W2T 



5¥2 ) V£UT5 



4,8e-161 



Protein name 



Description 



Locus Name 



Isp : J>YRG_UACSU 



Acc# 



P13242 



CTJ> tJVNTHAyiil, (UTJJ- -AMMONIA LK3A&E) (OTP SYNTHETASE) 



ORF Name 



NTID 



NT AA „ ^ ^ 
— , - — , Score Probability 
AAID Length Length x - 



3.24.5.3.a:L..ca...3.ia i 14100 



tsi — 1 



TIT 



Protein name 



Locus Name 



sp:YDGM_ECOLI 



Acc# 



P77223 



Description 

PUTATIVE FERREDOX IN- L I KE PROTEIN IN ADD -NTH INTERGEMIC REGION 



ORF Name 



NTID 



NT AA 

— ' — - t Score Probability 
AAID Length Length -— * L 



\l$£A2».cl..:i21 1 FTETT 



WTZT 



i$e 1 mi 



6.7e-84 



Protein name 



Locus Name 



cLTDP-6-deoxy-D-glucose-3 , 5 epimerase 



gp:AF048749 



ACC# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence. 



1075 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4069800_c2_246 


T102 


9524 


81 


245 


100 


A . 2&~ Ub 


Protein name 








Locus 


Name 


ACC# 










sp : RPCJ_ 


BPPH1 




Description 














IMMUNITY &WJ=>kli^ok 


PROTEIN 










l 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


42406:aJ:J_l6S 


4105 


9325 


180 


545 


178 


- 4.7e-i3 



Protein name 



Locus Name 



alanine--tkNA ligase, 



] [pir:ii!722lS" 



Description 



Acc# 



E72216 



NT 



— Score Probability 



ORF Name 



NTID 



AAID Length Length 



6.3e-144 



Protein name 

glucose-l-phosphate tnymxdyl transterase 



Locus Name 



gp:Al? l 04BV4y 



Acc# 



AF048749 



Description 

Sacteroides Iragilis capsular poly saccharide biosynthesis operon, complete 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4iS.5.0.aa...cl...i^i 


....... 4105 


5327 


364 


1095 


864 


2.4e-86 


Protein name 








Locus 


Name 


Acc# 










sp : GALE 


_BACSU 


P55180 


Description 















GALACTOUE 4 - UP IMUkAtW ) 



1076 



NT 



AA 



ORF Name 



NTID 



4507781 £2 73 



AAID Length Length 



Score Probability 
\TT2 



6.6e-29 



Protein name 



Locus Name 



|sp:YABH_BACSU 



Acc# 



P37550 



Description 

HYPOTHETICAL 3i.7 KI> PftCTSlN ±M ££PF-Pm& XmV&GmtlC ftfiGlON (OftPl) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



9325? 



TTZT 



Score Probability 
TFS 



|2.2e-0§ 



Protein name 



Locus Name 



cell wall -binding protein nomolog yocH 



pir ;E69901 



Acc# 
E69901 



Description 



NT 



AA 



ORF Name 



NTID 



4&7.6.&.8.2....a3....3.2.Q I 14108 



AAID Length Length 

tjzv — 



W5T 



Score Probability 
986 



Protein name 



Locus Name 



hypothetical protein TM0244 



pir :E72398 



Acc# 



E72398 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



14105 



Length Length 



Score Probability 



12 72 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



1077 



NT 



AA 



ORF Name 



NTID 



|6<W7187 cl 220 



wrnr 



AAID Length Length 

wm — 



Score Probability 
TT1 ' 



3 . 3e-18 



Protein name 



Description 



Locus Name 



sp:YDOT_ECOLI 



Acc# 



P77285 



HYPOTHETICAL Si . S KB ft&OTEIN lit ADD-NTH INTEftflflNlC REGION 



ORF Name 



NTID 



NT AA 

— , „ — , Score Probability 
AAID Length Length JL - 



733125 ±3 157 



T2T" 



0 . 00083 



Protein name 



Locus Name 



GRF MSV198 MTG motxt gene tamily protein 



Acc# 



AF063866 



Description 



Melanopius sanguinlpes entomopoxvirus , complete genome. 



ORF Name 



NTID 



NT AA 
_ i — ■ T — _ Score Probability 
AAID Length Length J - 



14112 



TOT" 



Protein name 



Locus Name 



hypothetical protein aq_12 73 



pir:C70410 



Acc# 



C70410 



Description 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



4113 



„ — , — , Score Probability 
Length Length — — 



Locus Name 



Acc# 



Description 
MO-HIT 



1078 



NT 



AA 



ORF Name 



NTID 



5865837 c2 326 



AAID Length Length 




\T7T 



Score Probability 

Tn — 



1 . Be-19 



Protein name 



Locus Name 



unknown 



gp:AP048V4y 



Acc# 



AF048749 



Description 



BacteroicLes tragiiis capsular polysaccharide JDiosynthesxs operon, complete 
sequence. 



NT 



AA 



ORF Name 



NTID AAID Length Length 

~§m — 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T2I 



Score Probability 
92 " 



0. 00016 



Protein name 



Description 



Locus Name 



Acc# 



gp:D90715 



Escherichia coli genomic DNA. (17 . 6 - 18.0 mm) . 



ORF Name 



NTID 



AAID 



NT AA 

— ^ — . Score Probability 
Length Length 



1II3.15.2lS....c2,..5.2 



ffTTT 



9T3T" 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1079 



NT 



AA 



ORF Name 



NT ID 



AAID 



11133437 ±1 3 



9340 



Length Length 
— 



Score Probability 

m$ — 



<=>.ae-42 



Protein name 



Locus Name 



receptor antigen (RagA; 



gp:P<3I130872 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



11715042 il 1 



4119 



AAID Length Length 




Score Probability 
T£TF7 — 



4.5e-16S 



Protein name 



Locus Name 



probable polyribonucleotide 
nucleotidyltransferase (pnp) 



pir :C71269 



Acc# 



C71269 



Description 



NT 



AA 



ORF Name 



NTID 



13.&££.aft3...±3...,3.i 



AAID Length Length 
"5T%2 



TETT 



Score Probability 
T33 



l2.9e-05 



Protein name 



Locus Name 



ceil surface antigen-like protein A29L 



pir:Tl7519 



Acc# 



T17519 



Description 



ORF Name 



NTID 



AAID 



|Mg.5.0.2.7..7.^il^.7. | [4121 | p^T 



Protein name 

Description 
NO -HIT 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



1080 



NT 



AA 



ORF Name 



NT ID 



AAID 



26364040 ti fe 



wnr 



Length Length 




Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



AccJ 



NT 



AA 



ORF Name 



NT ID 



lia&7.fiaj...±i...2& I mzz 



AAID Length Length 
Witt 



\JTT 



Score Probability 
0.00025 



Protein name 



Locus Name 



transmembrane sensor 



gp:AF05169i 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress t actor A (pstA) , ECF sigma tactor (tiul) , 
transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds. 



Si?, 
jt, i 



NT 



AA 



ORF Name 



NTID 



AAID 



iAi&4fiia...Ga...£& i [4T74 



Length Length 
[T7T" 



TTTT 



Score Probability 




5.4e-$S 



Protein name 



Locus Name 



butyrate kinase 



gp:AB016775 



Acc# 



AB016775 



Descri ption 



Clostridium pertnngens DNA tor butyrate kinase and hydrogenase, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



\±10.Z2.6.1.±1..±1 1 \%TZ5 



-WITT 



Length Length 
T5T 



F7TT 



Score Probability 
1.5e-15 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-like protein 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ri£>osomal protein S16 -likeprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



1081 



ORF Name 



|430b337 tl 4 



Protein name 

Description 
NO-HIT 



NT 



NTID 



AAID 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO -HIT : 



NT 



NTID 



AAID 



TTTT 



9349 



Length Length 



AA 

— , -u Score Probability 



TSTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
TTTT 



Score Probability 



Locus Name 



AccJ 



ORF Name 



6A3.B.ia7....ca,...^.S. 



Protein name 



NT 



AA 



NTID 



wnr 



AAID Length Length 
^1 — 



Score Probability 
3 . 2e-45 



Locus Name 



PTB CLOAE 



Acc# 
Q05624 



Description 

PHOSPHATE h LIT YRYLTRANS FERA^ R , ( PUOSPHOTRANS'BU'l'y kVLASK ) 



1082 



ORF Name 



Protein name 

Description 
(MO-HIT 



NT 



AA 



NTID 



AAID 



14130 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT AA 

„m -r — o_t_ t — Score Probability 
NTIP AAID Length Length £ - 



4131 



TOT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NT ID 



14132 



AAID Length Length 

— 



73T 



Score Probability 
W52 



Locus Name 



115lt outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Description 



1.4e-41 



Acc# 
JC6027 



ORF Name 



Protein name 



NT ID 



NT AA 
_ — ^, T — ^, Score Probability 
AAID Length Length ^ 



maturase-like protein 



Description 



7T 



0.0021 



Locus Name 



gpiCPESPSBC 



Acc# 



AJ222583 



Euglena spirogyra chloroplast partial psbC gene & complete internalmat2 
gene . 



1083 



ORF Name 



NT ID 



AAID 



NT AA 

— ^ — • S core Probab ility 
Length Length 



24736386 c3 3 



TB5" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



142A151ft...cl...fia I PT33* 



7T 



TIB - 



|4.be-07 



Protein name 



Locus Name 



iron (II J transport protein A 



tpir:C72423 



Acc# 



C72423 



Description 



NT 



AA 



ORF Name 



NTID 



1£&17.R17...±3....3.7. I WHZ 



AAID Length Length 



TT7T" 



Score Probability 

ists — 



2.0e-91 



Protein name 



Locus Name 



sp:PBPC_ECOLI. 



Acc# 



P76577 



Description 

AFUNCTIONAL PENICILLIN-BINDING PROTEIN lC PREffllftSOR (MP-1C) 



ORF Name 



NTID 



AAID 



NT AA 

— , — L1 Score Probability 
Length Length — ~ ^ 



ia&a2ia&..xi.;.a i 14137 



1325' 



l.le-43 



Protein name 



Locus Name 



cell cycle protein homolog mesJ 



pir :T31465 



Acc# 



T31465 



Description 



ORF Name 



NTID 



2AQfc.:7.6.5a...t:L..;L5 1 14138 



Protein name 



vsrD protein 



Description 



NT 



AA 



AAID Length Length 
Tim 



Score Probability 
2 . 7e-09 



TT7 



Locus Name 



pir :I40540 



Acc# 



140540 



1084 



NT 



AA 



ORF Name 



NT ID 



AAID 



41U03882 f 2 2$ 



TTTT 



Length Length 
PT5TT 



Score Probability 




4.5e-52 



Protein name 



Locus Name 



conserved hypothetical protein 



pir:H72370 



Acc# 
H72370 



Description 



NT 



AA 



ORF Name 



T — _* — ^ Score Probability 
NTID AAID Length Length JL 

ST5*S 



14140 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



a2a2L&iaa...ci...fi.4 1 ft?t 



NTID AAID Length Length 



Score Probability 
T7I — : 



1.7e-40 



Protein name 



Locus Name 



sp : FEOB_MET JA 



ACC# 



Q57986 



Description 

FERROUS IRON TRANSPORT £R0TEIM B HOM0LOG 



NT 



AA 



ORF Name 



NTID 



AAID 



imeMh7...±i...iA I irnr 



Length Length 



Score Probability 



Protein name 

Description 
PSTO-MXT — — 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



|3.AIB.3.43.7...±2...2.a I pm 



AAID Length Length 
— 



Score Probability 
751 



|2.3e-74 



Protein name 



Locus Name 



Na+/H+ antiporter homolog yheL 



pir :D69829 



Acc# 



D69829 



Description 



1085 



ORF Name 



NTID 



NT AA ^ „ , ^ . n . ^ 
r — ^ T — _ S core Probabil ity 
AAID Length Length JL 



14376056 cl 61 



4144 



ST 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



14145 



Length Length 



1185 



Score Probability 




|2.1e-5:5 



Protein name 



Locus Name 



antibiotic resistance protein homolog ydeR 



pir :D6977y 



Acc# 



D69779 



Description 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length - 



6Aa3.&2...tl..A.. 



TF7TT 



15613 



644 



Probability 
|1.5e-60 



Protein name 



Locus Name 



hypothetical protein JD2520 



pir :G6bQ^8 



Acc# 



G65028 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Ift6.fm3.5...±1...6. I FIT7 



Length Length 
T5T 



Score Probability 
T&l 



|6.,0e-iS 



Protein name 



Locus Name 



hypothetical protein CT276 



bir:A71535 



Acc# 



A71535 



Description 



ORF Name 



NTID 



117.y.:/.b.:/....tl....7. I 14148 



Protein name 

Description 
(NO-HIT 



AAID 



NT 



AA 



Length Length 
WTT 



Score Probability 



Locus Name 



Acc# 



1086 



ORF Name 



1251313 ci 26 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



9371 



Length Length 



11151 



Score Probability 
|6.5e-125 



Locus Name 



Acc# 



Q46127 



ORF Name 



20975(>5i ±1 1 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — ^ 



WT7T 



HT9 — i imru 



33B" 



$.6e-37 



Locus Name 



immunoreactive 42RD antigen PG33 



|gp:AF175715 



Acc# 



AF175715 



Description 



Porphyromonas gingival is strain W50 
complete cds . 



immunoreactive 42KD antigenPG33 gene, 



ORF Name 



NTID 



AAID 



— , — , Score Probability 
Length Length 



2.3.5.9.^2.53....Cl...ZZ.. 



£2 I [T5^ 



Protein name 



Locus Name 



Acc# 



Description 
(NO -HIT 



ORF Name 



NTID 



\ua5A0.i:l±i..ib. i m^i 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



1087 



ORF Name 



NT ID 



— , — , Score Probabxlxty 
AAID Length Length JL 



25516062 12 10 



5T73~ 



i.le-14 



Protein name 



Locus Name 



"immunogenic 7 5 kDa protein PG4 



gp:AF14580U 



ACC# 
AF145800 



Description 



Porphyromonas gingival is strain W5 0 immunogenic 75 kDa protein PG4gene, 



complete cds . 



NT 



AA 



ORF Name 



NT ID 



30210933 13 13 



4154 



AAID Length Length 
555 



5T73' 



T5T 



Score Probability 



4.2e-07 



Protein name 



Locus Name 



transposase 



gp:AF036§6£ 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5b2 0 transposase (JoipH) anctmoJoiiization 
protein BmpH (bmpH) genes, complete cds. 



ORF Name 



NTID 



AAID 



NT AA 
T — T — Score Probability 
Length Length 



5T7T 



[CTT5" 



Protein name 



Locus Name 



Acc# 



Description 
INO-HTT 



ORF Name 



NTID 



&7.1&23.6....£3....:L2. I 



Protein name 



Description 



NT AA 
T — , u T — _ Score Probability 
AAID Length Length 



5T7B- 



1081 I 132"^ 



TOT 



Locus Name 



sp:PYRi__DICDI 



Acc# 



P20054 



1088 



NT 



AA 



ORF Name 



NT ID 



7140552 c3 SO 



AAID Length Length 
9379 



Score Probability 
0.032 



37 



Protein name 



Locus Name 



EntT 



gp:AP09S0tltt 



ACC# 



AF099088 



Description 



Enterococcus faecium enterocin A (entA) , Entl (entl) , EntF (entF) , EntK 
entK) , EntR (entR) , bacteriocin- like protein, EntT (entT) , EntD (entD) , and 
protease IV homolog genes, complete cds; andunknown genes. 



ORF Name 



10741300 £3 30 



Protein name 



NTID 



AAID 



NT AA 

— , — „ Score Probability 
Length Length 



Locus Name 



Acc# 



Description 
ItJO-HIT 



ORF Name 



12.l0.9.10...±2L..l&.. 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



proline dipeptidase 



Description 



TT7TT 



Locus Name 



pir :D75419 



S.5e-23 



Acc# 



D75419 



ORF Name 



Protein name 



NTID 



AAID 



WIST 



NT 



Length Length 



AA 

— ' , Score Probability 



TUT 



Locus Name 



Acc# 



Description 
f^FITTT 



1089 



ORF Name 



NT ID 



AAID 



NT AA 

— ^ T — ^ Score Probabil ity 
Length Length aL 



9383 



T7T 



|4,6e-243 



Protein name 



Description 



Locus Name 



|sp:DHE4__BACFR 



Acc# 



P94316 



(MAI)(P)H-fifif>fiU£)EKft GLUTAMATfi DEHYD&O^ENA^E) 



ORF Name 



24251553 c2 40 



Protein name 



NT 



AA 



NT ID AAID Length Length 

Wizi 1 mm I I YTTT 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT AA 

— ^ — , Score Probability 
NTID AAID Length Length ; ; *~ 



«85 — I nui — I FDT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Z^B.9.5.U3.7....CZ...3.9. 



Protein name 



NT 



AA 



NTID 



AAID 



14154 



Length Length 



Score Probability 




Locus Name 



probable phosphoenolpyruvate synthase APE 002 6 



pxr :E727b4 



Description 



8 . 9e-06 



Acc# 



E72754 



1090 



ORF Name 



Protein name 



Description 



W 



NT 



AA 



NTID 



AAID 



4165 



Length Length 



Score Probab ility 

rn 



1.2e-05 



Locus Name 



sp : PPCE HUMAN 



Acc# 
P48147 



ORF Name 



3959800 C3 48 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



4166 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT ■ 



NT 



AA 



NTID 



AAID 



4167 



Length Length 
TM 1 [TOd — 



Score Probability 



Locus Name 



Acc# 



ORF Name 



2.Q.5.XQ.Q.5.2...X3...3.3... 



Protein name 

Description 
NO-HIT : 



NTID 



AAID 



NT AA 

— , — , . Score Probability 
Length Length 



1 \nrz 



Locus Name 



Acc# 



1091 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 

vjwi — 



TIT 



11011 



Score Probability 
TT7 



10.00030 



Protein name 



Locus Name 



transmembrane sensor 



(gp:AF0516£l 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress factor A (pstAj , ECF sigma f actor (fiul) , 
transmembrane sensor (fiuR) , and hydroxamate^typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



124354562 c3 66 



Length Length 
5T5~ 



Score Probability 



1548 



Protein name 

Description 
INO-HI'P 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



WTTT 



AAID Length Length 
UTTS — 



Score Probability 
TEE 



3.2e-07 



Protein name 



Locus Name 



unknown 



gp:U96771 



ACC# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-l, 4- endoglucanase, and 
mannanase genes, complete cds / and unknowngenes . 



ORF Name 



NTID 



AAID 



NT AA 

— . , — ^ Score Probability 
Length Length ^ 



\2lll&M.6....cl..A&. I I3T72 



9394 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



1092 



ORF Name 



NTID 



NT AA _ - . , • x- 
_ — _ — _ Score Probability 
AAID Length Length 



c2 54 



ffT7T 



Protein name 



115K outer memJorane protein precursor : SusC 
protein 



Description 



Locus Name 



pir : JC6U2 7 



z . ye-44 



Acc# 



JC6027 



ORF Name 



NTID 



NT AA 

~. _ __ T — T — ^. Score Probability 
AAID Length Length * L - 



l 2M£2512..±I...12 1 



4 . be-14 



Protein name 



Locus Name 



RNA polymerase sigma factor SigZ-iike protein 



|gp;AF13725T 



Acc# 



AF137263 



Description 



Eacteroides tlietaiotaomicron 3 OS ribosomal protein si6-likeprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3.4a.7.ma&...c2...5.i 1 14175 



193 9 7 



TT5T 



TTTT 



li.2e-07 



Protein name 



Locus Name 



unknown 



igp:TO^77i 



ACG# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase , B-l , 4-enctoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID AAID Length Length 

-^m — 



1128 I 13387 



Score Probability 
55T 



2 . le-86 



Protein name 



Locus Name 



receptor antigen (RagAJ 



bp: 1X41130*1 7 2 



Acc# 



AJ130872 



Description 



Porphyromonas gingivalis W50 receptor antigen (rag) locus encoctinga major 
immunodominant 55kDa antigen. 



1093 



ORF Name 



3929183 cl 47 



Protein name 

Description 
MO-HIT 



NTID 



NT AA 
T — T — Score Probability 
AAID Length Length JL 



WTTT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



8.3.15.5.D...,Gl...46. I 14178 



AAID Length Length 



Score Probability 
EFS — 



Protein name 



Locus Name 



115K outer membrane protein precursor : Sus(J 
protein 



Description 



pir : JC6027 



F 



le-56 



Acc# 



JC6027 



i! !"s 
iiil ;!>;;: 

FLI 



0 

f "5 

; ir; pi? 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length i - 



S.3.15.5.0..„c3....£l I 



1140 1 13423 



FDT" 



5.2e-S8 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130^72 



; Acc# 



AJ130872 



Description 



Porphyromonas gingival is W5 0 
immunodominant 55kDa antigen. 



receptor antigen (rag J locus encodinga major 



NT 



AA 



ORF Name 



NTID 



£5.0.a3.7....al...4£ I WT&U 



AAID Length Length 
?%U2 



TT7T 



Score Probability 
0.00017 



TIT 



Protein name 



Locus Name 



outer membrane protein 



gp:BS&<M>B 



Acc# 



L77614 



Description 



Bacteroides thetaiotaomicron 
cds . 



outer membrane protein (susD) gene, complete 



1094 



NT 



AA 



ORF Name 



NTID 



14181 



AAID Length Length 




Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



IiM3ia.7...±l..fi£ I 



AAID Length Length 
MM 



Score Probability 



Protein name 

Description 
WO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



i3.as.ao.D.3....ti...is I fts? 



NTID AAID Length Length 

9405 



Score Probability 
— 



|1.4e-17 



Protein name 



Locus Name 



unknown 



gp:AF04S749 



Acc# 



AF048749 



Description 



Bacteroides rragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



x^^.v.aaao....a2L...iQ2 | WTM 



NTID AAID Length Length 



Score Probability 



Protein name 

Description 
ETCPETT 



Locus Name 



Acc# 



1095 



NT 



AA 



ORF Name 



NTID 



AAID 



15103S00 tl 19 



Length Length 
S3 - 



Score Probability 
TT7T5TB 



7F 



Protein name 



Locus Name 



response regulator 



gprAFl^O^'T 



Acc# 
AF130997 



Description 



Enterococcus taecium strain BM43 3y vanD giycopeptide resistancegene 
cluster, complete sequence. 



ORF Name 



NTID 



NT AA 

— ■ — , Score Probability 
AAID Length Length 



15822S33 £2 2l 



75" 



T5F 



5.1e-iS 



Protein name 



Locus Name 



conserved nypotnetical protein yisg 



pir:H6S83? 



Acc# 
H69837 



Description 



ORF Name 



NTID 



Il£^.lfm...c2:...120. I 



Protein name 



AAID 



NT AA 

— „ — , Score Probability 
Length Length 



55" 



Locus Name 



Acc# 



Description 

pro^Enr 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — 



16.6.0.1A1Z..LZ.A0. I fflTO 



1.2e-187 



Protein name 



Locus Name 



UDP - ManNAc denydrogenase 



gp:AP125164 



Acc# 
AF125164 



Description 



Bacteroides tragilis 638R polysaccharide B (PS B2) biosynthesis locus , 
complete sequence; and unknown genes. 



1096 



ORF Name 



NT ID 



16832885 c2 101 



Protein name 



hypothetical protein 



Description 



NT 



AA 



T — ^, T — ^ Score Probabi lity 
AAID Length Length ^ 



FOX" 



T7TT 



Locus Name 



pir : JQ1020 



^.3e-177 



Acc# 
JQ1020 



ORF Name 



NT ID 



2fl.7.fi2ft£l2L-.±1...5a I BTTO 



Protein name 



NT AA 

, T — T — Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



Z16..7.ZlB.l...a3....13.7. ...I 14 1 9 1 



Protein name 



NT 



AA 



AAID Length Length 
^£T3 



Score Probability 



WTT 



Locus Name 



Acc# 



Description 
IHO-HIT 



ORF Name 



Protein name 



unknown 



Description 



NTID 



NT AA 

^ ^ _ — ■ — _ Score Probability 
AAID Length Length ' L ~ 



T7T~ 



i3.2e-13- 



Locus Name 



|gp:AF048749 



Acc# 



AF048749 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . . 



1097 



ORF Name 



NT ID 



NT AA 
_ — T — Score Probability 
AAID Length Length 



ci 94 



9415 



Protein name 



long- chain- tatty-acid CoA ligase 



Description 



1698 



Locus Name 



pir :D70386 



7.9e-58 



Acc# 



D70386 



NT 



AA 



ORF Name 



NT ID 



21£.S.BA5.0...±2..A6. I WTM 



AAID Length Length 
ME — 



Score Probability 
|2.7e-10 



Protein name 



Locus Name 



arylsultotransterase 



gp:AF126201 



Acc# 



AF126201 



Description 



Pseudomonas putida straxn s-313 sultate ester desulrurization genelocus, 
complete sequence . 



NT 



AA 



ORF Name 



NTID AAID Length Length 




Score Probability 



TUT" 



TTT 



Protein name 

Description 
MO-HM 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



22&£A±2&...al...±2Z. I I5T^ 



9418 



Length Length 



Score Probability 

m 



'0. 031 



Protein name 



Locus Name 



sp:S&fcC_XEMIA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) {BASEMENT MEMBRANE PkOTEIN BM-40) 



1098 



# 



NT 



AA 



ORF Name 



'22^600 ±1 17 



NT ID AAID Length Length 

— 



Score Probab ility 



MB" 



10.0024 



Protein name 



Locus Name 



retinoid X receptor alpha homolog 



gp:UPU3i*m 



Acc# 
U31832 



Description 



Uca pugilator retinoid X receptor alpha homolog mRNA, DNA bindingdorriain 
region, partial cds. 



ORF Name 



NTID 



NT AA 

, „ — ^ — , Score Probability 
AAID Length Length — JL 



24035212 c2 HO 



OF" 



Protein name 

Description 
1N0-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



2.&l£210.0....cl...!&0. I 



NTID AAID Length Length 

I9I3T — 



Score Probability 
SB? 



!6.$e-86 



Protein name 



Locus Name 



GDP-L-tucose pathway enzyme 



gp:AB008676 



Acc# 



AB008676 



Description 



Escherichia coli 0157 DNA, map position at 46 min. , complete cds. 



NT 



AA 



ORF Name 



\2&8M.b.0±...a'L^.b. I 



NTID AAID Length Length 

^TT2 — 



11161 



Score Probability 
T53 



9 . 9e-08 



Protein name 



Locus Name 



probable PPE protein 



pir :D70604 



Acc# 



D70604 



Description 



1099 



NT 



AA 



ORF Name 



2b47261 t2 37 



NT ID 
FOTJI 



AAID 



Length Length 
El 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



2.5.9.8.8A2....£l...ia | 14202' 



NTID AAID Length Length 
fM23 



Score Probability 




0.041 



Protein name 



Description 



Locus Name 



gp:F23A5 



Acc# 



AC011713 



Arabidopsls thaliana chromosome 1 BAC F2 3A5 sequence, completesequence . 



NT 



AA 



ORF Name 



NTID 



p-AMSs^ta^.?. I prjr 



AAID Length Length 
— 



Score Probability 



T5T 



Protein name 
Description 

NO-HI* 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
' — , — , Score Probability 
Length Length 



\l$£.te&ll.±l..A± I [4^04 



MIT 



IT7T 



3.5e-165 



Protein name 



Locus Name 



UDP-GlcNAc 2-epimerase 



gp:AF125164 



Acc# 



AF125164 



Description 



Bacteroides tragiiis fcibR polysaccharide B (PS B2) biosyntnes is locus , 
complete sequence ,* and unknown genes . 



1100 



# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



31444127 51 



4205 



TOT 



5 . 3e-162 



Protein name 



Locus Name 



tructose-bisphosphatase, 



Acc# 



C69621 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



iai4ftj.&..±3L.xi 



Length Length 
I3T" 



Score Probability 
125 



Protein name 



Locus Name 



probable Iipopolysaccharxde O-sicte chain 
biosynthesis protein (O-antigen transpoter) 



Description 



bir:F71152 



3.9e-07 



Acc# 



F71152 



W 

in 

law 

'!!} !!!|'.' 

fij 



ORF Name 



Protein name 



Description 



INO-HIT 



NTID 



AAID 



NT AA 
, — , „ — , Score Probability 
Length Length sL ~ 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



— , — J , Score Probability 
Length Length — 



210 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



ia5.1i3..7.5.3....al...7..7. I 



Length Length 



Score Probability 



ST 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1101 



NT 



AA 



ORF Name 



NTID 



AMD 



36134625 t2 36" 



Length Length 



Score Probability 
6 , Oe-138 



11351 



Protein name 



Locus Name 



glucose- l-pnospnate thymidyl transterase 



gp:Ay04aV4y 



Acc# 



AF048749 



Description 



Bacteroides tragi! is capsular polysaccharide biosynthesis operon, complete 
sequence. 



NT 



AA 



ORF Name 



'4455002 i3 50 



NTID AAID Length Length 

5¥T3 — 



55S — I irm 



Score Probability 

— 



5.6e-56 



Protein name 



Locus Name 



Acc# 



'spiYlDEJslCOLl 



Description 

HYPO T H ETI CAL 55.9 KB PROTEIN IN GLVC -IBPB INTER5ENIC REGION (ORFA) 



U 

U jiiii! 



NT 



AA 



ORF Name 



NTID 



WITT 



AAID Length Length 

Tim 1 



TO" 



Score Probability 
|6.0e-Ii5 



Protein name 



Locus Name 



CapSE 



gp:SAU7JJ74 



Acc# 



U73374 



Description 



Staphylococcus aureus type 8 capsule genes, cap8A, cap8B, cap8C,cap8D, 
cap8E, cap8F, cap8G, capSH, cap8I, cap8J, cap8K, cap8L,cap8M, cap8N, cap80, 
cap8P, complete cds * 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



imfim..±i...&a 



VZTT 



7WT 



li.Oe-76 



Protein name 



Locus Name 



dTDP-6-deoxy-D-glucose-3 , 5 epimerase 



gp:AF04S749 



Acc# 



AF048749 



Description 



Bacteroides rragilis capsular polysaccharide biosyntnesis operon, complete 
sequence . 



1102 



NT 



AA 



ORF Name 



NT ID 



4541937 c2 112 



AAID Length Length 




T7T" 



Score Probability 
|S.9e-i24 



I2T5T 



Protein name 



Locus Name 



Acc# 



GDP-mannose dehydratase 



gp:AF04 74 7 8 



Description 



Brucella melitensis strain 16M iipopoiysaccharide O side chambiosynthesis 
gene cluster, complete sequence. 



ORF Name 



NT ID 



5115927 12 42 



Protein name 



pleiotropic regulatory protein DegT 



Descri ption 



NT 



AA 



AAID Length Length 
9437 



Iff 



1095 



Score Probability 




Locus Name 



pir :D69025 



|4.1e-45 



ACC# 



D69025 



ORF Name 



Protein name 



NT 



AA 



NTID 



I5.s.7.ai.7.s..±3....6.a i wnz 



AAID Length Length 
— 



Score Probability 

iz — ~ 



Locus Name 



reverse transcriptase like protein l, 
intron- encoded 



pir :S58503 



Description 



0.04S 



Acc# 



S58503 



ORF Name 



NTID 



NT AA 
T — ^, T — ^, Score Probability 
AAID Length Length — 



\10£21B.D...±2...A& 



4217 



1355" 



ITJ77" 



Protein name 



Locus Name 



Acc# 



aspartate aminotransferase (aspb-likel) 
PAB0774 



pir :D75096 



D75096 



Description 



1103 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7314161!) tl 11 



TUZT 



|6.5e-38 



Protein name 



Description 



Locus Name 



'sp:YA38_HAl!!lW 



Acc# 



P44099 



HYPOTHETICAL ^kOl'UlN kliuiB 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



WfO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



W7T 



7T3~ 



7.2e-71 



Locus Name 



probable oxiaoreauctase 



IgpT^UFTT* 



Acc# 



AL132662 



Description 

Streptomyces coelicoior cosmxd b'll. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



14221 



T7T 



3.3e-^4 



Protein name 



Locus Name 



shikimate b-aenyclrogenase 



bir:F70i77 



ACC# 



F70377 



Description 



1104 



NT 



AA 



ORF Name 



NT ID 



AAID 



19444 



Length Length 



Score Probability 
77Z 



3 ,6e-30 



Protein name 



Locus Name 



conserved .hypothetical protein 



pir :G72409 



Acc# 



G72409 



Description 



ORF Name 



NT ID 



AAID 



12b.0.:/.^9.1...c2...217. I 



Protein name 



lemA protein 



Description 



NT AA 

— — Score Pr obab ility 
Length Length 



[27T" 



Locus Name 



pir :F72311 



2 . Oe-45 



Acc# 



F72311 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



ii£ianai...c£...23.a I wzrz 



2 . 9e-46 



Protein name 



Description 



Locus Name 



sp:ALR2_BACS0 



Acc# 



P94494 



PUTATIVE ALANINE RACEMASE, 



NT 



AA 



ORF Name 



NTID 



AAID 



15.ai5.15.6.2....al...lS.l I 142^5 



Length Length 
315 



Score Probability 



Protein name 

Description 
ISO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



15A5..7.U5.2...G2...211 1 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT ~ 



Locus Name 



Acc# 



1105 



NT 



AA 



ORF Name 



NT ID 



AAID 



15829675 12 57 



TZTT 



Length Length 



Score Probability 
TT? 



Protein name 



Locus Name 



receptor antigen (RagAj 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivalis W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



16131877 c2 231 



NTID AAID Length Length 
MSu — 



TTT 



TTT 



Score Probability 
TTZ 



1.7e-31 



Protein name 



Locus Name 



conserved hypothetical integral membrane 
protein HP1061 



pir :E64652 



Acc# 



E64652 



Description 



NT 



AA 



ORF Name 



NTID 



20MMA2...alJlti.b. J WITS 



AAID Length Length 
TO 



1043 I pra 



Score Probability 
1.2e-194 



TUZT 



Protein name 



Locus Name 



beta-galactosictase 



|pir:F72283 



Acc# 



F72283 



Description 



ORF Name 



NTID 



a.a.7.io.a.8.8...±i...i3.a i wttu 



Protein name 



glutamine-asparagme ricn protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



Locus Name 



gp:DDU07ai7 



0.043 



Acc# 



U07817 



Dictyostelium discoideum AX 3 glutamine-asparagme rich proteingene, partial 
cds . 



NT 



AA 



ORF Name 



22147552 c2 232 



NT ID AAID Length Length 




397 



TTPT" 



Score Probability 




Protein name 



Locus Name 



3 -O-acyl transferase , MolmB :micLecamycin 
biosynthesis enzyme 



pir :A42719 



Acc# 



A42719 



Description 



ORF Name 



Protein name 



NT ID 



NT AA 
, — ' , — ^ Score Probability 
AAID Length Length • L 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



Description 



NT ID 



NT AA 

— , , — , Score Probability 
AAID Length Length ~ — ; ^ 



WIJT 



7W 



1.4e-51 



Locus Name 



Acc# 



sp:LPXA_JiiCOLI 



(EC 2.3.1.125) ( tJDP - M - AC ET YLGLUCO S AM XNE ACYLTRAKfSFEftASE) 



ORF Name 



NT ID 



NT AA 

„ TT ^ T — _ T — _ Score Probability 
AAID Length Length 



;mim^c2J&itt | piT" 

Protein name 



TT7TT 



TTT 



li.le-23 



Locus Name 



sp:PUR5_METJA 



Acc# 



Q57656 



Description 

(AIRS) ( PHOS PHORI BOS YL - AMINO IMi DA^OLE SYNTHETASE) (AIR SYNTHASE) 



1107 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length JL 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



l±&hll02...cl.„212 I 



AAID Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



z^a3.a5..7..7....az...ziy... 



Length Length 



TTIT 



Score Probability 




: 4.9e-S0 



Protein name 



Description 



Locus Name 



sp:RFI__COXBU 



Acc# 



P47849 



PEPTIDE CHAIN RELEASE FACTOR i (RE-1) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2LAliaGi3L...C2...22L.7. I |423§ 



9460 



130 



T3T 



Protein name 

Description 
MO -HIT 



Locus Name 



Acc# 



1108 



NT 



AA 



ORF Name 



NTID 



124223382 c2 206 



AAID Length Length 
TUT? — 



Score Probability 
|9.2e~55 



Protein name 



Locus Name 



hypothetical protein slrisao 



pir:S77134 



Acc# 



S77134 



Description 



ORF Name 



NTID 



NT AA 

_ TT ^ T — _ _ — _ S core Probability 
AAID Length Length ^ 



4240 



9462 



Protein name 



ribosomal protein L2 0 



Description 



IT 



\ZTT 



70 



0.033 



Locus Name 



pir :A75326 



Acc# 



A75326 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ■ x - 



Z433.15.6.1,...a3....Z48. 1 14241 



1 . 2e-36 



Protein name 



Locus Name 



sp:YUAG_W!SU 



Acc# 



032076 



Description 

HYPOTHETICAL 56 . 0 KD PROTEIN IN GLGB-GBSB INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



2iiO.M5.7....c:L.m I 



Length Length 
1130 



Score Probability 
7.0e-55> 



S7S 



Protein name 



Locus Name 



hypothetical protein SI11582 



pir :S7530y 



Acc# 



S75309 



Description 



1109 



ORF Name 



Protein name 



Description 



(EC 3.5.1. «) 



NT 



AA 



NTID 



AAID 



Length Length 



TJWT 



Score Probability 
T&l 



2.7e-44 



Locus Name 



sp:LPXC_HAEIN 



Acc# 



P45070 



ORF Name 



24640752 r2 45 



Protein name 

Description 
KO-HIT 



NT 



AA 



NTID 



4244 



AAID Length Length 



Score Probability 



wrr 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



i4£S15£3L...ci...iia I 



AAID Length Length 
— 



7TT 



Score Probability 
|1.4e-37 



Protein name 



Locus Name 



OMP decarboxyiase-orotate phosphoribosyl 
transferase, 



pir :T30520 



Acc# 



T30520 



Description 



ORF Name 



NTID 



NT AA 

^. — Score Probability 

AAID Length Length ■ JL 



\lAS.9£.10^al^lA9. | p^T" 

Protein name 



uJoiquinone/menaqumone biosyntnesis 
me thy 1 1 r ans f e rase 



Description 



Locus Name 



pir :F75277 



14.78-44 



ACC# 



F75277 



1110 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



AAID 



14247 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



24MUi2..±2...44 1 pn^ 



Protein name 



provable giycosyl Hydrolase 



Description 



NT AA 

— — Score Probability 
Length Length J - 



7F" 



0.0023 



Locus Name 



pir :T36"467 



Acc# 



T36467 



i: lis? 
i! H 

in 



W 

n;;i! 

S!f ISIS 

w ffl? 



NT 



AA 



ORF Name 



NTID 



AAID 



Protein name 



Length Length 



1806 



Score Probability 
W7UT? 



91 



Locus Name 



polygalacturonase precursor 



pir:^i>7806 



Acc# 



S57806 



Description 



ORF Name 



NTID 



NT AA „ „ ■ . . _ . ^ 
T — _ T — ■ Score Probability 
AAID Length Length <L - 



9472 



Protein name 

Description 
IN'O-HIT ~ 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



2%l&l6£l..±l..±m I 



NTID AAID Length Length 

15373 — 



TTTT 



Score Probability 




8 . 9e-2i 



Protein name 



Locus Name 



Acc# 



histidme kinase 



|gp:£PAJ'6393 



AJ006393 



Description 

Streptococcus pneumoniae rr03 ana nk03 genes; two component system03 , 



1111 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — J - 



'31520840 r2 



Protein name 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



3.3.23.5. aaa..X2L...5i5i.. 



Protein name 



S¥7S~ 



64 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 
T ~ , T ~I. , Score Probability 
Length Length 



3.3.6.1Z.7.&.^...t2....ai.. 



Protein name 



14254 



STB" 



S57" 



TUT 



Locus Name 



5 . 2e-16 



Acc# 



hypotnetical protein aq_246 



Description 



pir:E70322 



E70322 



NT 



AA 



ORF Name 



NT ID 



AAID 



3.^1.7.3.0,.7.5....CZ...Z1^.. 



14255 



3177^ 



Length Length 
1557 



Score Probability 



1 . 6e-59 



Protein name 



Description 



Locus Name 



sp:PUR7_AkATH 



Acc# 



P38025 



(EC 6.3.2.6) ISA! CAR. SYMTHETAiJli) 



ORF Name 



NTID 



AAID 



NT AA 
T Score Probability 
Length Length 



M5.i6.M^t2 ^3.u | p55 1 ®Z7T 

Protein name 



1200 



7.7e-15 



Locus Name 



conserved hypothetical protein 



pir;C72261 



Acc# 



C72361 



Description 



1112 



ORF Name 



Protein name 



NTID 



NT AA 
_ T — . — • Score Probabi lity 
AAID Length Length J - 



mi 9 



79 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length • x - 



3.5AQ.±8.83....t±..A! I 



94S0 



ITT 



Locus Name 



Acc# 



Description 
MO-HIT 



ii iss? 

■ ■ ^ 

% v$ 

a IS, 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

„ Tn — _ — ^ Score Pro bability 
AAID Length Length — JL 



ITT 



2.0e-2$ 



Locus Name 



sp:YQKD_BACSU 



Acc# 



P54567 



HYPOTHETICAL 24.6 KD PR0TK1N IM IMTERGtlWIC REGION 



ORF Name 



NTID 



NT AA 

~ _ _ v — _ Score Probability 
AAID Length Length — -L 



3.3.4S.3.$.2...c;3„..2L£i I F^TT 



3¥F 



S7T 



6 .3e-56 



Protein name 

Description 
(EC 2.i.±.-). 



Locus Name 



sp:LPXD__RICRI 



ACC# 
P32202 



(FIRA PROTEIN) (R1FAMPIC1N RESISTANCE PROTEIN) 



1113 



ORF Name 



N'T ID 



NT AA 

_ _ _ _ — _ T — Score Probabi l ity 
AAID Length Length JL 



3956556 c2 2 ; 22 



9483 



JUT 



\525 



Protein name 



Locus Name 



tRNA isopentenylpyrophosphate transferase 
miaA 



Description 



|p:Lr:G69657 



Acc# 



G69657 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

wm, — 



7W 



Score Probability 




1.4e-37 



Protein name 



Description 



Locus Name 



sp : TRUA_BACSU 



Acc# 



P70973 



I) (PflEUDOUklDINE SYNTHASE I) (URACIL HYDROLYASE) 



o 

% aw 

W 

:!=* 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



\±lS±S£.2^aljL5.Q t ........ ^ [4263 | 

Protein name 



773" 



i.2e-30 



Locus Name 



conserved Hypothetical protein 



pir:G7231I 



Acc# 



G72311 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



l A2115fi2..±a...ia5.. 



Length Length 
-£51 1 



Score Probability 



Protein name 

Description 
INO-BIT 



Locus Name 



Acc# 



1114 



NT 



AA 



ORF Name 



NTID 



14767252 cl 174 



4265" 



AAID Length Length 

mw? — 



TWIT 



Score Probability 
5 . Ie-70 



V7TU 



Protein name 



Description 



Locus Name 



|gp:BMAJ4829 



Acc# 



AJ224829 



Bacillus megaterium DSM319 spoIV operon, ST 1 Clanking region, 3 ' r lankmg 
region. 



NT 



AA 



ORF Name 



NTID 



5082512 c3 265 



T — T — _ Score Probability 
AAID Length Length — JL 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



£}£Mi0...x:l„.l$.£ I m&T 



NTID AAID Length Length 

9489 



7T 



Score Probability 




3.3e-ll 



Protein name 



Locus Name 



conserved nypotnetical secreted, protein 
HP0320 



pir:H64559 



Acc# 



H64559 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|7.126.i3.1...aI...lB.B. I 



9490 



Length Length 



Score Probability 
TTTuTD 



72 



Protein name 



Locus Name 



leech zinc tmger protein 



Acc# 



X91396 



Description 

H.triserialis Lztl gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
OF" 



Score Probability 
TF1 



Protein name 



Locus Name 



sp:MECI_£TAEP 



Acc# 



P26598 



Description 

METHICILLIN RESISTANCE REGULATORY PROTEIN MECI 



ORF Name 



NTID 



NT AA 
T — _ — Score Probabilit y 
AAID Length Length ^ 



992787 C2 221 



l.be-47 



Protein name 
Description 

HYPOTHETICAL 51.0 KD PROTEIN IN PTA 3 1 REGION 



Locus Name 



sp:YWFO_fiAcJSU 



Acc# 



P39651 



il tnF 

o 



NT 



AA 



ORF Name 



NTID 



AAID 



¥T7T" 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



4272 



TOT 



Length Length 




174 



Score Probability 
" 



Protein name 

Description 
ROD SHAPE - DETERMINING PROTEIN kODA 



Locus Name 



Acc# 



sp:R0DA__EC0LI 



1116 



NT 



AA 



ORF Name 



NTID 



AAID 



14273 



9495 



Length Length 
TT5" 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TuTT 



TUT 



Score Probability 
TUB 



6 . 6e-06 



Protein name 



Locus Name 



hypothetical protein PH0217 



pir:G71244 



Acc# 



G71244 



Description 



NT 



AA 



ORF Name 



NTID 



2iaA45fl&...c2L...3L& 



AAID Length Length 
— 



ST" 



T5T 



Score Probability 
TulT 



T72e"=W 



Protein name 



Locus Name 



hypothetical protein PH0219 



pir:A7124b 



Acc# 



A71245 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



24ii.7.D.6.1...cl...^i I [4T7^ 



Length Length 



TuW 



Score Probability 



5.3e.-36 



Protein name 



Description 



Locus Name 



sp:METF_AOUAE 



Acc# 



067422 



5 , iO-M E THVLENBTETJ^AHVDROFQIATM REDUC T ASE , 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



UTT" 



Protein name 

Description 
IHO-HIT 



Locus Name 



Acc# 



1117 



NT 



AA 



ORF Name 



NTID 



14^78 



AAID Length Length 

mvo — 



Score Probability 
72 



0.034" 



Protein name 



Locus Name 



hypothetical protexn PH0220 



pxr:B7124S 



Acc# 



B71245 



Description 



NT 



AA 



ORF Name 



NTID 



5.1$££.l...a2..A! I 



AAID Length Length 
3"S3T — 



Score Probability 
|4.be-3S 



418 



Protein name 



Description 



Locus Name 



sp:YAAT BACSU 



Acc# 



P37541 



HYPOTHETICAL 31.2 KD PROTEIN IN XPAC-ABRB IMTERGEMIC kEGION 



NT 



AA 



ORF Name 



NTID AAID Length Length 

— 



T7TT 



TTTT 



Score Probability 
TF5 



Protein name 



Locus Name 



DNA polymerase III gamma subunit 



E 



H :A70460 



Acc# 



A70460 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



lD.8.3.8.a&5....al...2S.. 



Length Length 

ti 1 1 



Score Probability 



Protein name 

Description 
MO-HIT — 



Locus Name 



Acc# 



1118 



• 



NT 



AA 



ORF Name 



NTID 



AAID 



11197328 ±1 1 



Length Length 



Score Probability 




|4.be-112 



Protein name 



Description 



Locus Name 



sp : BGLS__AGRTU 



ACC# 



P27034 



GLUCOSlfifi GLUCOHYbROLASE) 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



126SaSS2 c3 7^ 



£.9e~37 



Protein name 



Locus Name 



sp:MMSR_PSEAE 



ACC# 



P28809 



Description 
MM^AB OPERON REGULATORY PROTEIN 



NT 



AA 



ORF Name 



NTID 



AAID 



14284 



35US" 



Length Length 



Score Probability 



[TuTT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



20.I17.G£..±i...2 1 



l,5e-84 



Protein name 



Locus Name 



L-arabinose transport (permease) araE 



pir :F6y587 



Acc# 



F69587 



Description 



1119 



NT 



AA 



ORF Name 



NT ID 



I246S1W7 ±2 14 



VIST 



AAID Length Length 

mm — 



iio2 I izim 



Score Probability 
|b.4e-217 



Protein name 



Description 



Locus Name 



sp:YKKSJ300Ll 



Acc# 



P76585 



HYPOTHETICAL 127.3 KB PROTEIN IN CSIg-CLyA IMTEftflBNIG fttidloM 



ORF Name 



NTID 



NT AA 
T — ^ T — ^ Score Probability 
AAID Length Length J - 



|3406$43£ ±1 3 



li.Se-18 



Protein name 



Locus Name 



beta-gaiactosidase 



gp:AFt)S5482 



Acc# 



AF0 554 82 



Description 



Thermotoga neapolitana galactose utilization operon, completesequence . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length - — - J ~ 



9510 



9.5e-14 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-like protein 



gp:AF137263 



Acc# 



AF137263 



Description 



Bacteroictes tnetaiotaomicron 3 OS ribosomal protein S16 -likeprotexn, fucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



l&lll43L2..±2...2a I 



AAID Length Length 



W5T 



Score Probability 

o.oois 



Protein name 



Locus Name 



BcDNA.UHllSVJ 



gp:AP14567± 



Acc# 



AF145671 



Description 



Drosophila meianogaster clone GH113 7 3 BcDNA. QHllb>V3 (bcDNA.GHlliJ 73)mRNA, 
complete cds. 



1120 



NT 



AA 



ORF Name 



NTID 



AAID 



120895303 r2 21 



'4290 



Length Length 
T2TT 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2im2...cA...&a 



S5TX 



Length Length 



2445 



Score Probability 
5T5 



1.2e-46 



Protein name 



Locus Name 



receptor antigen (RagA) 



|gp:PGI130872 



Acc# 



AJ13 08 72 



Description 



Porphyromonas gingivalxs W50 receptor antigen (rag) locus encodinga ma]or 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



AAID 



NT AA 

— ^ T — ^ Score Probability 
Length Length ^ 



i2411S3.fiL±I..lS I 



WIT 



7T 



0.0064 



Protein name 

Description 
NODULATION PROTEIN NOLP 



Locus Name 



sp:NOLP_RHlLJ> 



Acc# 



P23717 



NT 



AA 



ORF Name 



NTID 



AAID 



4293 



19515 



Length Length 
S3 



Score Probability 



Protein name 
Description 

wo-hit — : 



Locus Name 



Acc# 



1121 



NT 



AA 



ORF Name 



NTID 



26594067 cl 19 



AAID Length Length 
— 



Score Probability 
TS5 



Protein name 



Locus Name 



transmembrane sensor 



gp:AJ?'0yi6<)i 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A (pstAj , ECF sigma tactor (tiul) , 
transmembrane sensor (fiuR) , and hydroxamate- typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



4103577 ±3 30 



Length Length 
TH5~ 



Score Probability 



Protein name 

Description 
IMO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



liaaiflLS£a..±i...4flL. I 



Length Length 

K&n — 



Score Probability 
8.5e-2S5 



Protein name 



Locus Name 



neuraminidase precursor 



gp : BNRNANA^Jil 



Acc# 



D28493 



Description 



Bacteroides tragilxs nanH gene tor neuraminidase, complete cds. 



ORF Name 



NTID 



AAID 



NT AA 
„ — , - — , Score Probability 
Length Length • — 



148.7.b.3.0.0....c:2....S.D. I WITT 



9519 



g. 7 e-0B 



Protein name 



Locus Name 



unknown 



tap:U9&77i 



ACC# 



U96771 



Description 



Prevotella bryantii putative .polygalacturonase , B-l, 4-endoglucanase , and 
mannanase genes, complete cds; and unknowngenes . 



1122 



NT 



AA 



ORF Name 



162676Ja c2 67 



NT ID AAID Length Length 




Score Probability 



ST 



Protein name 

Description 
NO- HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2Lisa5&&3L„±aL...jLa i 



Length Length 



Score Probability 
4 . 8e-84 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



2.2.8.f5.15.6.2...jfc.2...2.8... 



AAID Length Length 




Score Probability 
TTTJI 



|1.5e-lii 



Protein name 



Locus Name 



hypothetical protein TM1624 



pir:H7i>228 



Acc# 



H72228 



Description 



NT 



AA 



ORF Name 



NTID 



2.&2.5.63.&2..X2....XZ... 



AAID Length Length 

v&n — 



Score Probability 
T25 



li.Be-06 



Protein name 



Locus Name 



unknown 



|gp:U%771 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase , B-l, 4 -endoglucanase , and 
mannanase genes, complete cds; and unknowngenes . 



1123 



ORF Name 



NT ID 



NT AA „ _ , , . . . . 
— • — Score Probability 
AAID Length Length 



242596^ f2 27 



691 



rrrr 



8 . le-35 



Protein name 



Locus Name 



sialic-acid o-acetyiesterase 



gp:MM'O404O8 



Acc# 



U40408 



Description 



Mus musculus lysosomal sialic acid o-acetylesterase mRNA, completecds . 



ORF Name 



NTID 



NT AA „ _ , , . - . . 
— — , Score Probabi lity 
AAID Length Length 



24413577 t:i 33 



TUT 



l.2e-07 



Protein name 



Locus Name 



unknown 



gp:t»677i 



Acc# 



U96771 



Description 



Prevotella bryantn putative polygalacturonase , b-i , 4 -endoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



Protein name 



NT 



AA 



\lll29A16....c±...5.± ......J F^M 



NTID AAID Length Length 

mzz — 



Score Probability 



1ST" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — ■ Score Probability 
AAID Length Length 



9527 



S7T 



12022 



Locus Name 



Acc# 



Description 



NO-HIT 



1124 



NT 



AA 



ORF Name 



NTID 



AAID 



'24545937 t3 41 



Length Length 
JSTD" 



TZUT 



Score Probability 

05 — 



\'2 . 6e-06 



Protein name 



Locus Name 



sp:J>AlU_RAT 



Acc# 



035264 



Description 

ACTIVATING FACTOft A(iBTYLHH)RoLASB AL&HA 2 aUBUNX T) (PAff-AH ALPHA a) 



NT 



AA 



ORF Name 



2772937 c2 59 



NTID AAID Length Length 

— 



1093 1 13252 



Score Probability 
13 . Oe-89 



£23 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



, JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
i.2e-54 



ITU 



Protein name 



Locus Name 



Acc# 



alpna-L- fucosidase, 1 precursor, 
tissue : alpha-L- fucosidase I : alpha- L-fucoside 
fucohydrolaae . — 



pir:HWHUFA 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3.Ml£l2.?...±l...ll I ffT^ 



Length Length 



Score Probability 
|9.2e~26 



Protein name 

Description 
(BETA-WAHASE) 



Locus Name 



sp:HEXA_PORGI 



Acc# 



P49008 



ORF Name 



135351593 c3 124 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



rftttt 



AAID Length Length 
-$532 



Score Probability 



IT 



r 7TE~ 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
(BETA-NAHASE) 



NT 



AA 



NTID 



AAID 



WJTT 



Length Length 



Score Probability 
TTT3 — 



1.8e-I15 



Locus Name 



sp:HEXA_F>0k<41 



Acc# 



P49008 



ORF Name 



Protein name 



unknown 



Description 



NT 



AA 



NTID 



AA&A6.&.1...L1...11 I 



AAID Length Length 
5533 — 



T5W 



Score Probability 
TT5 



l.Se-06 



Locus Name 



gp:TO677l 



Acc# 



U96771 



Prevotella bryantix putative polygalacturonase ,B-l, 4-endoglucanase , and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



£lLMM2lxi1.-&4 .J WTTE 



Length Length 
TTT 



Score Probability 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



1126 



ORF Name 



NTID 



NT AA 
T — x — Score Probability 
AAID Length Length 



781932 tl 4 



1118 I [7XS7 



rarer 



5. ye-82 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130872 



Acc# 
AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



5FT7 



TTUT 



Score Probability 




5.ie-§5 



Protein name 



Locus Name 



115K outer membrane protein precursor :SusC 
protein 



pir : JC6027 



Acc# 



JC6 02 7 



Description 



ORF Name 



NTID 



NT AA 

„ „ ^ ^ — r — Score Probability 
AAID Length Length — ^ 



9538 



1120 I 13363 



|2.6e-79 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6 02 7 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



1127 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
5540 



1158 



Score Probability 




4 ; ve-44 



Protein name 



Locus Name 



probable nagA protein 



pir:C7084b 



Acc# 
C70845 



Description 



ORF Name 



NTID 



NT AA 

— _ — ^, Score Probability 
AAID Length Length JL 



1197 



i.8e-35 



Protein name 



Locus Name 



hypothetical protein bl325 



pir:H648Sl 



Acc# 



H64881 



Description 



ORF Name 



NTID 



|U&5£2£3...±l..A I PT^T 



Protein name 



Hypothetical protexn 



Description 



NT 



AA 



AAID Length Length 




Score Probability 
ff73 



Locus Name 



pir:A72430 



2.7e-48 



Acc# 



A72430 



1 ORF Name 



NT 



AA 



Protein name 
Description 



NTID AAID Length Length 

v&n — 



Score Probability 



TIT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



\llA16h.6..±lJ15. I 



AAID Length Length 



TTTT 



Score Probability 
3.8e-18 



VT5 



Protein name 



Locus Name 



polysugar degrading enzyme nomolog ykt'cT 



pir :A69856 



Acc# 



A69856 



Description 



1128 



ORF Name 



NT ID 



ci 62 



TTZT 



Protein name 



9545 



hypothetical protein phu^iv 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



TTT 



Locus Name 



pir:G71244 



I.5e-06 



Acc# 



G71244 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA „ _ , , . _ . . 
— , — , Score Probability 
Length Length 



4324 



2.3e-58 



Locus Name 



Acc# 



sp:HTRAJi!cJoLl 



PROTEASE DO PRECURSOR, 



w 



ORF Name 



NTID 



2.46.422.11.,tz...3.u | WTZ5 



Protein name 



AAID 



NT 



AA 



Length Length 
TUT 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



\2&6£5M.1..±2J20... 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 



phosphate transport system regulator Phou 



Description 



Score Probability 



Locus Name 



bir:S7^275 



Acc# 



G72275 



ORF Name 



2Z$.112t2.±±..± 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length — — 



WTTT 



2TTT 



Locus Name 



Acc# 



Description 



MO-HIT 



1129 



NT 



AA 



ORF Name 



NT ID 



126367177 c3 110 



AAID Length Length 
1320 



955T7 



Score Probability 
1253 



I2.2e-15> 



Protein name 



Locus Name 



sensory protein Kinase 



pir :T30222 



Acc# 
T30222 



Description 



NT 



AA 



ORF Name 



NTID 



|29.3.15.2£..±2„.3.1 ...J POTS 



AAID Length Length 
3551 — 



793" 



Score Probability 




i.3e-12 



Protein name 



Locus Name 



clostripain-relatecl protein 



pir:147235i 



Acc# 



B72351 



Description 



NT 



AA 



ORF Name 



NTID 



lCxl$Al±l..±l...$± I liTJTF 



AAID Length Length 
5552 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



[ ¥T3T" 



^ — ^, T — ^ Score Probability 
AAID Length Length JL 

5553 — 



TOT 



5uT" 



5TT 



1.2e-§3 



Protein name 



Locus Name 



&igA 



|gp:OTU6771S 



Acc# 



U67718 



Description 

Cniorobium tepiaum sigA (sigAj gene, complete cas. 



1130 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 




Score Probability 
ll.8e-35 



Protein name 

Description 
RIBOFLAVIN ^YNTJHLA^Ji: ALPHA 



Locus Name 



Acc# 



sp:RI^A__BA<J^U 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



IKfC-HM 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— , — , Score P robability 
Length Length 



TIT 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



16A101Zl.±±..A& 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



189 



ITT 



4,2e-iS 



Protein name 
Description 

PU T ATIVE NAD (^)M N I TR0REI1D U C T A£3 R YDG1, 



Locus Name 



Acc# 



P96707 



1131 



NT 



AA 



ORF Name 



NTID 



AAID 



U65251S7 ±2 19 



Length Length 
ITT 



Protein name 



phospnate transport ATP binding protein 



Description 



Score Probability 

— 



Locus Name 



p?Tr7 



'2.4e-47 



Acc# 



G70390 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



sp:RBN_HAEIS 



Acc# 



P44608 



RI BONUCLE AS E BN, (RNASE BN) 



If! 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ~ .—■ J - 



i Aaai£Stt...c;i...m.. j wjjz 



RT7T" 



Protein name 

Description 
IN0-H1T 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
T T — . , Score Probability 

Length Length 



14333 



Protein name 

Description 
IWO-MIT 



Locus Name 



Acc# 



1132 



NT 



AA 



ORF Name 



NTID 



AAID 



78155 ci 67 



14340 



Length Length 



Score 



7X1 



Probability 
|2.3e-35 



Protein name 



Description 



Locus Name 



sp:PHOPJ4A0yU 



Acc# 
P13792 



E>H0£> 



ORF Name 



NTID 



AAID 



NT AA 

- — , — , Score Probability 
Length Length • 



15103300 ci 56 



14541 



75" 



O.OlS 



Protein name 



Locus Name 



response regulator 



gp:AFl30W7 



Acc# 



AF130997 



Description 



Enterococcus taecxum strain BM43^y vanD giycopeptide resistancegene 
cluster, complete sequence. 



NT 



AA 



ORF Name 



NTID 



15..7.26.3....C.3....5.2. I RET¥2 



AAID Length Length 
9564 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length : 



^5F5~ 



11723 



2.3e-177 



Locus Name 



hypothetical protein 



bir:J01020 



Acc# 



JQ1020 



Description 



ORF Name 



NTID 



15832885 t3 33 



Protein name 



AAID 



nypothetical protein 



Description 



NT 



AA 



Length Length 



Score Probability 
7TS 



Locus Name 



pir: J01020 



i3.1e~ 7 0 



Acc# 
JQ102 0 



ORF Name 



NTID 



iaafi£3...±3L...2£x i 



Protein name 



NT AA 
T — T — Score Probability 
AAID Length Length i ~ 



Locus Name 



Acc# 



Description 

no-hit : 



ORF Name 



NTID 



NT AA 
_ — ^, _ — ^, Score Probability 
AAID Length Length JL 



20.0.a.7.7.5.1...cl...6.3. 



[3TT 



6 . Be-146 



Protein name 



Locus Name 



putative UDP-GlcNAc : undecaprenyiphosphate 



gp : AF048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



ORF Name 



NTID 



NT AA 
_ — _ _ — _ Score Probability 
AAID Length Length — 



20A0.S.3.Z...G2...5.2..... I 



fZET 



1ZT 



TOT" 



i.0e-16 



Protein name 



Locus Name 



putative giycosyi transterase 



gp:LPN7311 



Acc# 



AJ007311 



Description 



Legionella pneumophila serogroup l lipopolysaccharide biosynthesisgene 
cluster. 



1134 



NT 



AA 



ORF Name 



|iS055416i ci 61 



XTmTT ^ ™-r^ _ — T — ^, Score Probability 
NTID AAID Length Length * L 

SFTO — 



POTS 



9T~ 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



226.5.Mb.u...cI...M I 



AAID Length Length 

9571 



IT 



Score Probability 




2 . 7e-l0 



Protein name 



Locus Name 



arylsultotranst erase 



gp:AFi^01 



Acc# 



AF126201 



Description 



Pseudomonas putida strain S-313 sulfate ester desuiturization genelocus, 
complete sequence. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length — • £ ~ 



l 2iafi£ti2ifiL...ci..Aa I 



ST" 



0,031 



Protein name 



Locus Name 



sp:SP£C__XENLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) (BASEMENT MEMBRANE PROTEIN BM-40) 



NT 



AA 



ORF Name 



NTID 



T33T" 



AAID Length Length 
1252 



Score Probability 
R53 1 10.031 



Protein name 



Locus Name 



sp:SPkCJU!!NLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) (BASEMENT MEMBRANE PROTEIN BM-40) 



ORF Name 



24239253 £1 7 



Protein name 



NTID 



AAID 



NT AA 

— ■ , — , Score Probability 
Length Length 



3W 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



CapSJ 



Description 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TUET 



TIT 



9 . 7e-08 



Locus Name 



gp: SAUtil973 



Acc# 



U81973 



Staphylococcus aureus capsule gene cluster CapBA through CapBPgenes, 
complete cds . 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 

— 



Score Probability 



T5T 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



NT 



AA 



NTID 



l iiaafiaai„.ai...ia | 



AAID Length Length 



Score Probability 
85 



0.010 



Protein name 



Locus Name 



Acc# 



hypothetical protein, 57.8 KD 



|gp : r>OLi>45436 



Description 



Pseudomonas puticla OCT plasmid. a lk genes cluster ana tiankmg DNA, strain 
TF4-1L. 



1136 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length J - 



3314076 c2 46 



Protein name 



probable lipopolysaccnancte O-side chain 
biosynthesis protein (O-antigen transpoter) 



Description 



Locus Name 



|pir:F711b2 



6.0e-19 



Acc# 



F71152 



NT 



AA 



ORF Name 



NTID 



AAID 



3.3.4.6.D.9.5.2...c2...5ti.. 



Length Length 
T7T7 



Score Probability 
75 



9 . 5e-05 



Protein name 

Description 
DJWA-BIMDING PROTEIN HRLbi 



Locus Name 



sp;DBH5_RHILE 



Acc# 



P02348 



NT 



AA 



ORF Name 



NTID 



AAID 



3£A7.0.9.S.7...±3....27. I 



7SSTT 



Length Length 



TFT" 



Score Probability 

inns 



7T 



Protein name 

Description 
HIPB PROTEIN 



Locus Name 



isp:HIPB_ECOLI 



Acc# 



P23873 



NT 



AA 



ORF Name 



NTID 



AAID 



3.aD..7.1^1...tl...XX I 14359 



9581 



Length Length 
WO 



Score Probability 



2*T 



Protein name 

Description 
INC -HIT : — 



Locus Name 



Acc# 



1137 



ORF Name 



Protein name 

Description 
NO -HIT 



NT 



AA 



NT ID 



AAID 



Length Length 



Score Probability 



T3T 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



l41D.3.3.&7....al„.3.S. I WJZT 



9b83 



probable rhamnosyitransl erase 



Description 



NT AA n , _ , n , 
T — ^. — * Score Probability 
Length Length 



Locus Name 



|pir:H7559S 



Acc# 
H75596 



ORF Name 



Protein name 

Description 
IWO-HIT 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



WT 



Locus Name 



Acc# 



ORF Name 



Protein name 



unknown 



NT 



AA 



NTID 



AAID 



RT3TTT" 



Length Length 
T7 - 



Score Probability 
0.023 1 



37 



Locus Name 



gp:APi34706 



Acc# 



AF134706 



Description 

Sinorhizobium meliloti insertion sequence ISRml4, completesequence . 



1138 



ORF Name 



NTID 



MI M score Probability 



I419842S c'A 47 



AAID Length Length 
TFT 



T37T 



i.6e-0B 



Protein name 



Description 



Locus Name 



gp:AB000222 



Acc# 



AB000222 



Staphylo coccus capitis epr gene , complete ccts . 



NT 



ORF Name 



NTID AAID Length Length 



— Score Probability 



4704715 ci 62 



TT5TT 



4 . le-148 



Protein name 

Ur)P-glucose-4-epimerase/aTDP-giucose-4, b 



Locus Name 



|gp:AF04874y 



Acc# 



AF048749 



Description 

Bacteroides tragilis capsular poly saccharide biosynthesis operon, complete 



sequence . 



ORF Name 



Protein name 



NT 



5.9.m^±l^l I 



NTID AAID Length Length 

TU5 



— Score Probability 



Locus Name 



Acc# 



Description 



N0-U1T 



ORF Name 



5.3..7.^.kb....al^iJ.. 



Protein name 



NTID 



AAID 



14367 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



IN0-H1 T 



1139 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

rm — 



Score Probability 
|3.Se-26 



295" 



Protein name 



Locus Name 



glycosyi trans t erase 



pxr :GV55yb 



Acc# 



G75596 



Description 



O 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



ii&ai*£fitt...ta...itt™ i 



1330 



1.0e-i35 



Protein name 



Description 



Locus Name 



Acc# 



|sp:ISTB__BACFk | Q45120 



I NSERTION SEQUENCE IS21-LIKE PU T ATIVE A TP-BINDING PROTEIN 



NT 



AA 



ORF Name 



NTID 



[¥T7u~ 



AAID Length Length 
— 



Score Probability 
|7.4e-m 



TF7T- 



Protein name 



Locus Name 



lsp:TRA2JBA<JFR 



Acc# 



Q45119 



Description 

T RANS POS&SE F OR I NSER T ION S EQUENCE ELEMENT 1^2 i -LIKE 



NT 



AA 



ORF Name 



NTID 



AAID 



WT7T 



Length Length 

rm — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



ino-mj:t 



1140 



NT 



AA 



ORF Name 



NTID 



22460186 ±2 6 



AAID Length Length 
Bl 1 |T^ 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2&<n.7.M3...±2...4 1 P7I 



Length Length 
2TT" 



Score Probability 
TTu"5 



2 . ve-112 



Protein name 



Description 



Locus Name 



sp : TRA2_BACFR 



Acc# 



Q45119 



TRMSPCSASE FOR INSERTION aEQUEMcjJhl KLEMEOT i'S2i-LIRE 



ORF Name 



NT AA 

, Tmn .^ ™™ v — ^ — Score Probability 
NTID AAID Length Length : z - 



aas252£a..±a...i | wtm 



TTT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



fZTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1141 



NT 



AA 



ORF Name 



NTID 



AAID 



row 



— _ , — _ Score Probability 
Length Length 

— 



i. 3e-i4y 



Protein name 

Description 
AMIMOfRANggKRAgg) 



Locus Name 



sp:BIOA HAEIN 



Acc# 



P44426 



NT 



AA 



ORF Name 



NTID 



126SS437 c3 302 



AAID Length Length 
— 



11470 



Score Probability 

— 



|4.9e-56 



Protein name 



Locus Name 



immunoreactive 53 JcD antigen PG123 



gp:AF144£4i 



. Acc# 
AF144641 



Description 



Porpnyromonas gingival is strain W5 0 immunoreactive 53 KD antigenPG123 gene, 
complete cds . 



NT 



AA 



ORF Name 



I2LM.7..7..7..7...t2...9.a 



NTID AAID Length Length 




Score Probability 



F7T 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



13Ltt2tt45a..±a...l54 I RTT73" 



Length Length 



Score Probability 

m, 



10.019 



Protein name 
Description 

(EC 5. 2.1.8> (PPIASE) (ROTAMASE ) 



Locus Name 



Acc# 



|sp:FKBA_ECOLI 



1142 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



13835937 12 100 



14390 





9502 




454 1365 789 



Protein name 



Locus Name 



dihyd.ro! xpoamicle 
dehydrogenase, : 2-oxoglutarate dehydrogenase 
complex chain F3:acetoin dehydrogenase complex 



pir : !4U'/y4 



Description 



2.2e-78 



Acc# 



140794 



ORF Name 



14095406 cl 182 



Protein name 



NTID 



[4381 



AAID 



NT AA „ „ , , , , . ^ 
— , — , Score Probability 
Length Length ^ 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



i4.7.mn...ai...iai i m&z 



Protein name 



AAID 



NT AA „ ^ _ ■ , _ . ^ 
— , — , Score Probability 
Length Length 



T3T 



Locus Name 



Acc# 



Description 
KO-HIT 



ORF Name 



NTID 



AAID 



14.7.3..7.5.0.7....c2...22l I 



Protein name 



hypothetical protein APE1673 



Description 



NT 



AA 



Length Length 
TIT 



Score Probability 
TTT~ — 



Locus Name 



pir :E72548 



Acc# 



E72548 



1143 



NT 



AA 



ORF Name 



15040893 13 137 



— _ — , Score Probability 
NT ID AAID Length Length L 

vzuz — 



3132 



TIT 



|1.7e-120 



Protein name 



Locus Name 



receptor antigen (RagA) 



Acc# 



AJ130872 



Description 



Porphyromonas gingivaiis W50 receptor antigen (rag) locus encoclinga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



AAID 



16832885 13 155 



Protein name 



hypothetical protein 



Description 



NT 



AA 



Length Length 



Score Probability 
T&T — 



Locus Name 



|pir: J01020 



1.6e-75 



ACC# 
JQ1020 



ORF Name 



ll.7.S3.I7..„G3....2.a2.„ 



Protein name 

Description 
NO-HIT 



NTID 



AAID 



NT 
Length 



AA 
Length 



Score Probability 



73" 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



X3.W.6.2....t±..A& [ WW7 



Length Length 



Score Probability 



fZZT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



1144 



NT 



AA 



ORF Name 



NT ID 



'205S2W0 c3 V21 



t — . ^ T — ^_ Score Probabi lity 
AAID Length Length i - 

^TT3 



|4.Se-8i 



Protein name 



Locus Name 



Salmonella typhimurium transcriptional 



gp : STy$TMJ?l 



Acc# 
AF170176 



Description 

Salmonella typhimurium tragment STMF1, 



ORF Name 



NTID 



AAID 



NT AA 

— ^ — ^ Score Prob abi lity 
Length Length ■ J - 



120750302 il 44 



T5T 



11380 



|6.3e-71 



Protein name 

Description 
CHAIN TRASS ACVIASE) 



Locus Name 



sp:0DB2_BA«U 



Acc# 



P37942 



NT 



AA 



ORF Name 



NTID 



AAID 



21^.7.I0.16....c2...2S^ I 



Length Length 
SB 



Score Probability 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— ' — Score Probability 
AAID Length Length ^ 



miixusa. i ftst 



2 . 3e-26 



Protein name 



Locus Name 



gp:AB0230S4 



Acc# 



AB023064 



Description 

Listeria monocytogenes DNA tor DnaK operon, complete cds. 



1145 



ORF Name 


NT ID 


7V 7\ TT\ 

AA1JJ 


NT 


AA 
Length 




Score 


Probability 


22860128_jtlJ^ 


4392 


9614 


83 


252 






0.031 


Protein name 








Locus 


Name 


Acc# 












__XENLA 


P36378 


Description 
















(OSTEONECTIN) (oJsi) 


(BASEMENT 


MEMBRANE 


PROTEIN BM-40) 










ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


23851527__c3_3l2 


4393 


96l5 


405 


1218 




1026 


1.7e-103 



Protein name 



Description 



Locus Name 



sp:MOFJblAUilvj 



Acc# 



P44422 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1652 



TTT" 



i.2e-06 



Protein name 



Locus Name 



unKnown 



| gp:U%771 



Acc# 



U96771 



Description 



E>revotelia bryantix putatxve polygalacturonase, B-l, 4-endoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



\2±6a±$.i2.±±..ao. I 



NTID AAID Length Length 

mr? — 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1146 



ORF Name 



NTID 



NT AA 

- — ^ — ^ Score Probabi lity 
AAID Length Length ■ JL 



124644015 c3 296 



F2W 



7.4e-16 



Protein name 



Locus Name 



prolidase 



gp:AB014613 



Acc# 



AB014613 



Description 



Aureobacterium esteraromaticum gene tor prolidase, complete cds , 



NT 



AA 



ORF Name 



NTID 



AAID 



I24S45252 t3 123 



Length Length 
T325 — 



Ml 



Score Probability 
T7Z 



i.^e-45 



Protein name 



Locus Name 



immunoreactive 50kD antigen PG53 



gp:AF175720 



Acc# 



AF175720 



Description 



Porpnyromonas gmgivaiis strain W50 immunoreactive 50KD antigenPG53 gene, 
complete cds. 



NT 



AA 



ORF Name 



Z&6.fm&Q..7....a3....3.Ql.. 



NTID AAID Length Length 




Score Probability 



TW 



Protein name 



Locus Name 



Acc# 



Description 
BTO-HIT 



ORF Name 



NTID 



14355 



Protein name 



hypothetical protein A556L 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



0.036 



Locus Name 



pir :T1805S 



Acc# 



T18058 



1147 



NT 



AA 



ORF Name 



NT ID 



AAID 



25448546 rl 18 



TZTT 



Length Length 



Score Probability 
1.5e-54 



Protein name 



Locus Name 



|sp:YFOS_METJA 



Acc# 



Q58903 



Description 

HYPOTHETICAL ASC TRANSPORTER AH>- BINDING PROTEIN MJi50§ 



13 



ORF Name 



NTID 



AAID 



NT 
Length 



AA 
Length 



Score Probability 



255315 cl 162 



4401 



T£7T 



7T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — ^ 



16.16MM...±2„.$A I M 



"515" 



TWIT 



1.3e-15S 



Protein name 



Locus Name 



propionyl-CoA carboxylase 



gp:AB007600 



Acc# 



AB007000 



Description 



Myxococcus xanthus MxppcB gene tor propionyl-CoA carboxylase , complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



9625 



Length Length 
53TJ 1 



TIT 



Score Probability 
^UT ' 



5 ,6e-48 



Protein name 



Locus Name 



sp:BID2_HAEIN 



Acc# 



P45248 



Description 

2) (MB SYNTffilTA&W 2) (DTBS 2} 



1148 



NT 



AA 



ORF Name 



^ , -r^ — , — _ Score Probability 
NT IP AAID Length Length x 

ff¥03 — 



Protein name 



acetyl -CoA carboxylase (biotin carboxylase 
subunit) accC 



Description 



T7TT 



Locus Name 



pir : A69581 



i.2e-132 



Acc# 
A69581 



NT 



AA 



ORF Name 



NT ID 



2^M425...±2...£S. I 



4405 



AAID Length Length 
9627 



FH7T 



Score Probability 
[2T3 



b.6e-15 



Protein name 



Locus Name 



hypothetical protein aq_294 



pir:H70326 



Acc# 



H70326 



Description 



NT 



AA 



ORF Name 



aiaftiafta..,fi...ii4 i w 



NT ID AAID Length Length 

9628 



ITS" 



T5T 



Score Probability 



TOT 



Protein name 



Locus Name 



Hypothetical protein APE1466 



pir :B72626 



Acc# 



B72626 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



9629 



Length Length 
TZ5 



Score Probability 
|2.2e-30 



Protein name 
Description 

PROBABLE LIJWATU-PROTEIN LIGA^ A, 



Locus Name 



IsprLPLAJWPN 



Acc# 



P75394 



1149 



NT 



AA 



ORF Name 



NTID 



AAID 



22116703 c2 26S 



Length Length 



[68 



Score Probability 
TT5 



4.4e~09 



Protein name 



Locus Name 



hypotneticai protein APE2 061 



pir :G72bIU 



Acc# 
G72510 



Description 



ORF Name 



NTID 



NT AA 
v — _ T — ^_ Score Probability 
AAID Length Length ^ 



3.Z2.2.6.5.^a...t3....L5.2... 



T7T" 



l.ie-33 



Protein name 



Locus Name 



Acc# 



rlavodoxm 



pir :A28670 



Description 



ORF Name 



NTID 



iafiii4ii...ti...i2fi I ests 



Protein name 



conserved nypothetical protein 



Description 



NT 



AA 



AAID Length Length 




Score Probability 
|2.0e-09 



T7T 



Locus Name 



|pir:G723&S 



Acc# 



G72385 



ORF Name 



Protein name 

Description 
MTETT 



NT 



AA 



NTID 



AAID Length Length 

— 



Score Probability 



1542 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



NT AA 
T — . -i — ^, Score Probability 
AAID Length Length JL 



4412 



9634 



T¥3~ 



5.8e-14 



Locus Name 



methylmalonyl-coa decarboxylase gamma chain 
PAB1771 



E 



ir:P75135 



Acc# 



F75135 



Description 



1150 



NT 



AA 



ORF Name 



NTID 



14413 



AAID Length Length 

mrz — 



Score Probability 



POT 



TTOT 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



[4414 



AAID Length Length 
9636 



Score Probability 
12^8 



2 .2e-44 



Protein name 
Description 

PUTATIVE BIOTIN SYNTHESIS PROTEIN BIOC 



Locus Name 



sp:BIOC_HAEIN 



Acc# 
P45249 



NT 



AA 



ORF Name 



NTID 



3.3.2L3.0.3.i...cl...20.l I [23TS 



AAID Length Length 
SFT7 — 



T7T 



tut 



Score Probability 
25S 



6 .5.e-22 



Protein name 



Locus Name 



membrane protexn 



pir :G64590 



Acc# 



G64590 



Description 



NT 



AA 



ORF Name 



NTID 



3.%&8A'£5...XX..3. 



AAID Length Length 



TZUT 



Score Probability 
856 



l .7e-85 



Protein name 



Locus Name 



aspartate aminotransterase 



pir :D72220 



ACC# 



D72220 



Description 



1151 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



4337*482 ±3 Ibl 



2088 



4.4e-103 



Protein name 



Locus Name 



proJoatoXe (pyruvate) oxoisovalerate 
dehydrogenase alpha and beta fusion 



bir:G71526 



Acc# 
G71526 



Description 



NT 



AA 



ORF Name 



\&l6.2L±b..±±..A I FTTF 



NTID AAID Length Length 




11317 



Score Probability 
233 



|4.4e-23 



Protein name 



Locus Name 



sp:YCFW_ECOLI 



Acc# 
P75958 



Description 

HYPOTHETICAL 45.3 KD PkOTEIN IN MFb-COBB IMTEkGENIC REGION 



NT 



AA 



ORF Name 



NTID 



&±&ao:i:l.zijxio. i pro 



AAID Length Length 
TZ%% — 



9641 



Score Probability 
T73 



|6.1e-35 



Protein name 



Locus Name 



Hypothetical protein APE188 7 



pir :G72575 



Acc# 



G72575 



Description 



ORF Name 



NTID 



\4.&m6£l...clJ10.1 I mzu 



Protein name 



L- lactate permease (IctP) homo log 



Description 



NT 



AA 



AAID Length Length 



T5TT 



Score Probability 
331 



l>7e~94 



Locus Name 



pir : q70175 



Acc# 



C70175 



1152 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TUT" 



TUT" 



1.0e-05 



Protein name 



Description 



Locus Name 



sp : m'TJWkdl 



Acc# 



P43158 



THIOL PROyBASfi/HjaMAaaLtTPlMIM E>kTT ftkECURSOR, 



NT 



AA 



ORF Name 



NTID 



4957582 c2 281 



AAID Length Length 
— 



EST" 



Score Probability 
— 



|7.8e-l6 



Protein name 



Description 



Locus Name 



sp:CHAC_S&HHE 



Acc# 
Q59288 



tCHONDROITIN SULFATE LYASE) (CHONDROITIM AC ELIMINASE) 



NT 



AA 



ORF Name 



NTID 



aiasia2...c2...aj5 i 



AAID Length Length 
— 



127S 



Score Probability 
|4.7e-.26 



Protein name 



Locus Name 



Acc# 



sensor protein pilS 



pir:S705i!8 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



>5.11$£1...CX.:±1& I [44^ 



Length Length 
T37T3 — 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1153 



NT 



AA 



ORF Name 



54802 c2 264 



4425 



NTID AAID Length Length 

— 



Protein name 



NADH dehydrogenase, .-protein slr0851 .-protein 
slr0851 



Description 



TTTT 



Score Probability 
513 



Locus Name 



pir :S74826 



6.4e-63 



Acc# 
S74826 



ORF Name 



NTID 



NT AA ^ , . , , 
„ — v — _ Score Probability 
AAID Length Length JL 



6.28A6.5....CX..17.2. 



4426 



6 . ye-34 



Protein name 



Locus Name 



sp:YJV3_ YEAST 



Acc# 



P40896 



Description 

HYPOTHETICAL 35. ^ KD PROTEIN IN HXT8-CRT1 INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



8.a2.9.12....£l..Al I 1442 7 



Length Length 



Score Probability 



TT5BT 



Protein name 

Description 
IMFTTTT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
r^t-u t ™^h Score Probability 
Length Length 



ai42fli,.c2i...2z& I 



TTTT 



2.9e-09 



Protein name 



Locus Name 



Acc# 



EpsG 



gp:AF03648b 



Description 

Piasmid pNZ4000, complete sequence. 



1154 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Pro babi lity 
Length Length J - 



1172330 tl 2 



1.4e-106 



Protein name 



Locus Name 



Ketol-acid reductoisomerase 



gp:PSP16743 



Acc# 



Y16743 



Description 



Piromyces sp . E2 mRNA tor ketoi-acxd reductoisomerase. 



ORF Name 



NTID 



20443775 cl 23 



Protein name 



hypothetical protein T18E12.6 



Description 



NT 



AA 



AAID Length Length 




1533" 



Score Probability 
~l |d.2e-71 



7T7 



Locus Name 



pir :T02699 



Acc# 



T02699 



NT 



AA 



ORF Name 



NTID 



AAID 



Protein name 
Description 

m&rgTr— 



2M48.40.2..±3....i£. I mJT 



Length Length 



Score Probability 



1977 



Locus Name 



Acc# 



ORF Name 



ii&:ws±i..±±..± 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID 



9654 



Length Length 



Score Probability 



Locus Name 



Acc# 



1155 



NT 



AA 



ORF Name 



NT ID 



11*4017827 ±2 12 



— — n Score Probability 
AAID Length Length • L - 



2 . be-15 



Protein name 



Locus Name 



palmitoyl-acyl carrier protein thioesterase~ 



|gp:AF034266 



Acc# 
AF034266 



Description 



Gossypium hirsutum palmitoyl-acyl carrier protein tmoesterase (FatBl) mRNA, 
partial cds. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



2¥J7TT7TTl~~9" 



9656 



I73TT 



fZTTT 



|A.5e-245 



Protein name 

Description 
HYDkO-LYASE) (ACOMITASE) 



Locus Name 



sp:AC0Nj3&AVS 



Acc# 



P49609 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TFT 



TUT 



a.8e«ll 



Protein name 



Locus Name 



acetolactate synthase 



pir :E70459 



Acc# 



E70459 



Description 



ORF Name 



NTID 



4s.ma;>,±i...io. I [44^ 



9658 



Protein name 



isocitrate dehyrogenase 



NT 



AA 



AAID Length Length 




Score Probability 
3F7 



Locus Name 



gp:BIISOCIT 



Description 

Bacillus Israeli isocitrate dehydrogenase gene. 



3.5e-d8 



ACC# 



Y13358 



1156 



NT 



AA 



ORF Name 



NT ID 



AAID 



9659 



Length Length 
HOT" 



57F" 



Score Probability 
I.7e-62 



Protein name 



Description 



Locus Name 



gp:ABU22 8 6 7 



Acc# 
AB022867 



Prevotella ruminicola genes tor poiyA 
and cellulase, complete cds. 



polymerase , D-alanineglycinepermease 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length ~ JL 



13S37557 c3 75 



1 [FFOT 



T2T 



TTT 



TTT 



■S.3e-07 



Protein name 



Description 



Locus Name 



gp:M2ECWM 



Acc# 



M36913 



z.mays cell wall protein mRNA, T 1 end. 



ORF Name 



NTID AAID 



NT AA 
, — L1 — ^ Score Probability 
Length Length JL 



|lM&iD.0.2..±2...16. I F^TS 



3 . 3e-22 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI040S 



Locus Name 



sp : YEBA_HAEIN 



Acc# 



P44693 



ORF Name 



NTID 



AAID 



NT AA 

— . — , Score Probability 
Length Length ^~ 



184 



555" 



Protein name 



Locus Name 



4 -methyl -5 (b- hydroxy ethyl ) -thiazole 
monophosphate biosynthesis protein (thiJ) 



pxr :D70177 



Acc# 



D70177 



Description 



1157 



ORF Name 



NTID 



NT AA 

_ ^ — ^ — ^ Score Probability 
AAID Length Length JL 



4441 



TOT 



|3.Be-33 



Protein name 



Locus Name 



probable nucleoside -diphosphate Kinase, 



pir :C71116 



Acc# 



C71116 



Description 



NT 



AA 



ORF Name 



4442 



NTID AAID Length Length 

9664 



TITT 



!T3T" 



Score Probability 
531 



[2.le-57 



Protein name 



Description 



Locus Name 



Igp : PGPUT 



Acc# 



X97228 



P. gingival is gpdxJ, put, and yhbG-pg genes . 



ORF Name 



NTID 



Z6.Z0,^iai...C3....6.1 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 
992 



6 . 7e~ioa 



Locus Name 



pir : JQ102 0 



ACC# 
JQ1020 



ORF Name 



2£.7.5.S.3.8.7...±2....17... 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



4444 



9666 



Length Length 

— 



Score Probability 



444 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



1158 



ORF Name 



NTID 



14446 



Protein name 



conserved hypothetical protein yacM 



Description 



NT 



AA 



AAID Length Length 




57T 



Score Probability 
2 . 2e-30 



TTB" 



Locus Name 



pir :S661iy 



Acc# 



ORF Name 



NTID 



NT AA 
v — _ _ — Score Probability 
AAID Length Length -L 



iiiaam..±2L...ia | F¥T7 



TOT* 



TOT" 



|3.5e-57 



Protein name 



Locus Name 



triosephosphate isomerase 



gp:AF043386 



Acc# 



AF043386 



Description 



Clostridium acetobutylicum glyceraldehyde- 3 -phosphate dehydrogenase (gap) , 
phosphoglycerate kinase (pgk) , and triosephosphate isomerase (tpi) genes, 
complete cds; and 2 , 3-bpg- independent phosphoglyceratemutase (pgm-i) gene, 
partial cds. 



i 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ^ 



1%A16£.25...±2..±5.. I HWff 



TUT 



7.6e-13i 



Protein name 

Description 
AT Jr> -DEPENDENT DNA HELICASE REOT, 



Locus Name 



sprftECfijSYtWa 



Acc# 



Q55681 



ORF Name 



NTID 



AAID 



NT AA 
„ — , — , Score Probability 
Length Length — — 



267 



tut 



i.9e-06 



Protein name 



Locus Name 



hypothetical protexn PHS004 



bir:j?'7i245 



Acc# 



F71245 



Description 



1159 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



35205387 r3 



Probability 
|6.5e-22 



Protein name 



Description 



Locus Name 
sp: T OMi*_NiiilcJu 



Acc# 



006432 



TONS ERO'l'fclltt 



NT 



AA 



ORF Name 



NTID 



AAID 



4155587 ±3 24 



14451 



Length Length 

wzz — 



Score Probability 



TIT 



|2.0e-66 



Protein name 



Locus Name 



pyrxdoxal phospnate syntnetase 



gp : E>C^UT 



Acc# 



X97228 



Description 

P.gingivaiis gpdxJ , put, ana ymotj-pg 



genes , 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



|iS.7.8A6.i..±l...b.., 



Probability 

li.6e-oy — 



Protein name 



Description 



Locus Name 



sp:TOLk_UAElM 



Acc# 



P43769 



TOLR frftOTJ i ilJsl- 



NT 



AA 



ORF Name 



NTID 



AAID 



1T5T 



W7"5~ 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1160 



ORF Name 



NTID 



NT AA 

„ TT% T — ^ — _ Score Probability 
AAID Length Length JL 



15676 



7^" 



2 . 3e-19 



Protein name 



Locus Name 



hypothetical protein 



gp:PST24 3 3 54 



Acc# 



AJ2433 54 



Description 



Pseudomonas stutzeri hypl and comA genes and putative tolQ, exbB,tolR and 
exbD genes . 



ORF Name 



10137 c2 219 



Protein name 



NTID 



M31T 



carbonic anhydrase homolog ytiB 



Description 



NT 



AA 



AAID Length Length 
9677 



Score Probability 



537 



Locus Name 



pir : F69993 



l.le-Sl 



ACC# 



F69993 



ORF Name 



NTID 



10.6.3.226,a...al...3.21 1 



Protein name 



AAID 



9678 



NT 



AA 



Length Length 
S3 - 



Score Probability 



TUT 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



10.6..7.5.6.a0....a2...M£ I (3^7 



Protein name 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
WO-HIT 



1161 



NT 



AA 



ORF Name 



NTID 



AAID 



12600340 £3 137 



Length Length 
IWB 1 



Score Probability 



Protein name 

Description 
JNO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\±26.16M±..±±..A 1 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



1N0-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



461 



i.6e-56 



Protein name 



Locus Name 



Algl 



|gp:PAU50202 



Acc# 



U50202 



Description 



Pseuclomonas aeruginosa alginate gene cluster Algl (algl) , AlgJ" (aigj; and 
AlgF (algF) genes, complete cds . 



NT 



AA 



ORF Name 



ma5ia&..±2.,.iii I F^r 



NTID AAID Length Length 

19683 



Score Probability 



FIT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1162 



ORF Name 



NT ID 



NT AA 

_ _ T — ^ — _ Score Probability 
AAID Length Length JL 



I4S41001 t2 i*2 



Protein name 



thiamin biosynthesis protein homolog 



Description 



mr 



Locus Name 



pir :H69260 



i.7e-28 



Acc# 



H69260 



ORF Name 



NTID 



NT AA 
_ — — Score Probability 
AAID Length Length — 



1368 



3 . 7e-07 



Protein name 



Locus Name 



KIAA1275 protein 



gp:AB033101 



Acc# 



AB033101 



Description 



Homo sapiens mRNA tor KIAA12 7 5 protein , partial cds. 



NT 



AA 



ORF Name 



NTID 



U6£0.28.:/....g3....MA .1 



AAID Length Length 

— 



Score Probability 
416 



6 . 4e-67 



Protein name 



Locus Name 



outer membrane protein 



gp:BNEOMPB 



Acc# 



L77614 



Description 



Bacteroides tnetaiotaomicron outer membrane protein (susD) gene , complete 
cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



mi2L5i&.„ci...a£a | 



9687 



Length Length 
TTT 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
NO-HIT ' 



1163 



ORF Name 



14^4712 fl 4^ 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length — ^ 



T21T 



Locus Name 



S.Oe-OS 



Acc# 



Description 



sp:MEXR_PSEAE 



MULTIDRUG RESISTANCE 0PER0N REPRESSOR 



P52003 



ORF Name 



161015S7 tl 50 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J - 



4467 



1080 



Locus Name 



1.7e-37 



Acc# 



|sp:BMKA_HAKIH 



Description 
MULTIDRUG RESISTANCE RROTKIN A H0M0L0G 



P44928 



ORF Name 



NTID 



NT AA 

„ _ _ T — T — _ Score Probability 
AAID Length Length — 



ifia^aaa...c2...2a5 1 imp? 



Protein name 



hypothetical protein 



Description 



TTZT 



2 . 3e-l77 



Locus Name 



pir : JQ1020 



Acc# 



JQ1020 



ORF Name 



lg.3.0.5...±2.„..7.5. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT ~ 



1164 



ORF Name 



20015643 c3 312 



Protein name 

Description 
INO-MT 



NT 



AA 



NT ID AAID Length Length 

— 



Score Probability 



sir 



JUT 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
(EC i. -.-.-) 



NT 



AA 



NTID 



AAID 



Length Length 



F7X 



Score Probability 




J.2e-20 



Locus Name 



sp : YM6 7__ARCFU 



ACC# 



028017 



ORF Name 



NT 



AA 



NTID 



|20.S.0.S.S.3.0....cl...l&4 1 W%T2 



AAID Length Length 




12865 



Score Probability 
355 — ~ 



3.3e-28 



Protein name 



alpha-amylase, precursor :protexn C062 0 



Locus Name 
[pir:573057 



Acc# 



S73087 



Description 



ORF Name 



NTID 



NT AA 

„ , ^ „ — ^, — ^. Score Probability 
AAID Length Length JL 



^22B3£&±..±1„±1& I WFn 



9695 



Protein name 



Locus Name 



thiol zdisuitide interchange protein homolog 
yneN 



pir :E69891 



Description 



4 . oe-iu 



Acc# 



E69891 



1165 



NT 



AA 



ORF Name 



NTID 



22459655 C3 360 



T — _ T — _ Score Probabil ity 
AAID Length Length z - 

9696 



TIT 



TTTT5" 



14 .7e-06 



Protein name 



Locus Name 



transmembrane sensor 



gp:AF05I6M 



Acc# 



AF051691 



Description 



Pseuaomonas aeruginosa stress factor A {psxA; , ECF sigma ractor (fiul) , 
transmembrane sensor (fiuR) , and hydroxamate- typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



2247S431 ti §4 



AAID Length Length 
9697 



1ST 



1154 



Score Probability 




l.le-23 



Protein name 



Locus Name 



sp:YRK0J3ACSU 



Acc# 



P54442 



Description 

HYPOTHETICAL 45.4 KD PROTEIN IN BLTR-SPOIIIC INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



4476 



AAID Length Length 
9698 



Score Probability 
770 



l l.Se-113 



Protein name 



Locus Name 



cytosolic phosphoglycerate Kinase 1 



gp:AB01S410 



Acc# 



AB018410 



Description 



Populus nxgra PnCytPGKl mRNA for cytosolic phosphoglycerate kxnasel, 
complete cds . 



NT 



AA 



ORF Name 



\22&£A±2B....al..3A0. I wm 



NTID AAID Length Length 

— 



TIT 



Score Probability 




0.031 



Protein name 



Locus Name 



sp:SPRC__XENLA 



ACC# 



P36378 



Description 

(OSTEONECTIN) (ON) (HA&EMENT MEMBRANE PROTEIN HM-40) 



ORF Name 



NT ID 



_ _ __ T — ■ T — _ Score Probability 
AAID Length Length 



23554553 ±2 81 



Protein name 



endonuciease III 



Description 



TIT 



] 



Locus Name 



pxr:B715i3 



b.0e-4 7 



Acc# 
B71919 



ORF Name 



NT ID 



NT AA 

— , v — , Score Probability 
AAID Length Length 1 ^ 



Protein name 



4479 



] 



rrnr 



Locus Name 



0.0059 



Acc# 



Description 



P18275 



MGININE/ORMITHINE AMTIP0RTJ2R 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



Protein name 



TIT 



tt&t 



TO" 



Locus Name 



|4.2e-46 



Acc# 



115K outer membrane protein precursor ; SusC 
protein 



pir : JG6027 



Description 



JC6027 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — - — : 4 - 



[ l&0±65.25....c2..25.9. I [S¥ST 



TIT" 



1032 



TT7¥" 



Protein name 



Locus Name 



3.4e-119 



Acc# 



|sp:ALFJTOEF>A 



Description 



083668 



NT 



AA 



ORF Name 



NTID 



'24017303 cl 242 



AAID Length Length 

mm — 



£7T 



2034 



Score Probability 
TS&l — 



4 . 6e~298 



Protein name 



Locus Name 



puilulanase 



gp:BTU67061 



Acc# 



U67061 



Description 



Bacteroides thetaiotaomicron puilulanase (pull; gene, complete cds . 



ORF Name 



NTID 



NT AA 

_ _ T — _ T — Score Probability 
AAID Length Length ^~ 



24041626 13 161 



nr 



l.Se-06 



Protein name 



conserved nypotnetical protein MTH83 



E 



Locus Name 
ir.-F«210 



Acc# 



F69210 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — — 



Protein name 



TTT 



Locus Name 



Acc# 



Description 
NO-HIT ~~ 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — 



2i41.7.0..7.S..±3....15a I 



Protein name 



TF7~ 



"TOT - 



Locus Name 



4 . oe-31 



Acc# 



Hypothetical protein PH0272 



Description 



pir:A714S2 



A71452 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



T2W 



Score Probability 
S.0e-08 



T57 



Protein name 



Locus Name 



conserved hypothetical protein BB0195 



pir :C70124 



Description 



Acc# 



G70124 



1168 



NT 



AA 



ORF Name 



NTID 



AAID 



4487 



Length Length 



11332 



Score Probability 

wn — 



Protein name 



Locus Name 



antibiotic resistance protein homolog ywoG 



Description 



pxr:B70065 



i,7e-39 



ACC# 



B70065 



NT 



AA 



ORF Name 



NTID 



AAID 



1AL^116..±2J1L I 



9710 



Length Length 



Score Probability 




2.8e-78" 



Protein name 



Description 



Locus Name 



Acc# 



lsp:SYJ?'A BACSU 



-TRNA LIGA^E ALPHA CHAIN) (PHERS) 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



25&3A3J3....&2„.2±6. , I 14489 



vr 



Protein name 
Description 

ro^rrr 



Locus Name 



Probability 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



Score Probability 



2^M3.fL..a3....3.5.& I 



9112 



Protein name 

Description 
WTO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



1^78^7 7 £2 5^ 



WJTT 



Length Length 
TJT 



Score Probability 
1.8e-16 



Protein name 

Description 
HYPOTHETICAL PkOTWIN 



Locus Name 



|sp:Y798_METJA 



Acc# 
Q58208 



NT 



AA 



ORF Name 



NT ID 



£2 &3 



AAID Length Length 
WTTZ — 



Score Probability 




|6.4e-51 



Protein name 



Description 



Locus Name 



ABOlSSVtf 



Acc# 



AB019578 



Microcystis aeruginosa mcyA, mcyB and mcyC genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



I2fi3.atas2..±i...2a I 



S7TT 



Length Length 
T7T 



Score Probability 



WIT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



m7.I0A7....ci....A15... 



7T 



Protein name 

Description 
[MO-HIT 



Locus Name 



Acc# 



1170 



ORF Name 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



AAID 



WTTT 



Length Length 
TF3 



Score Probability 



60 



Locus Name 



Acc# 



ORF Name 



m.7.£a2s...ti...aa 



Protein name 

Description 
BTO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 
F^S — 



Score Probability 



28b 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NT ID 



aiaSL45a3....c2...2L4J. I IW7 



AAID Length Length 

Tm — 



7TT 



Score Probability 
503 



4 . ye-48 



Protein name 



Locus Name 



115K outer memJorane protein precursor : SusC 
protein 



pir: JCS027 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



Aia7.&fift7...±i...ai i 



Protein name 



acyl carrier protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



Locus Name 



pir :S28475 



i.4e-05 



Acc# 



S28475 



1171 



NT 



AA 



ORF Name 



NT ID 



32620812 f2 108 



AAID Length Length 
— 



Score Probability 
14. 8 e -10 



T7F 



Protein name 



Locus Name 



VceB 



|gp:AF012101 



Acc# 



AF012101 



Description 



Vibrio cholerae ettlux gene A (vceA) and ettlux gene B (vceB) multidrug 
resistance pump genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



3377027 cl 213 



AAID Length Length 
TTTT 



Score Probability 
TT5 " 



|i.3e-06 



Protein name 



Locus Name 



Hypothetical protein 



pir :F72216 



Acc# 



F72216 



Description 



NT 



AA 



ORF Name 



NTID 



3A4.1&U5.Z...C.3....3.3.3. 



AAID Length Length 

— 



F7F" 



Score Probability 
!>.5e-25 



Protein name 



Locus Name 



conserved hypothetical protein aq__^l7l 



pir :D70486 



ACC# 



D70486 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3.5.S.Za^6.1...c3....3.16.., 



5724 



Length Length 



Score Probability 



Protein name 
Description 

ino-hit : — ~ 



Locus Name 



Acc# 



1172 



NT 



AA 



ORF Name 



NT ID 



'363S0ff00 tl 112 



„^ TT ^ T — s_ — Score Probability 
AAID Length Length JL 

9725 



HT75~ 



T2r 



0 . 00057 



Protein name 



Locus Name 



Acc# 



unKnown 



gp:AF013216 



Description 



Myxococcus xanthus Dog (dog) , isocitrate lyase (ici; , Mis (mis) ,Uro Into) , 
fumarate hydratase (fhy) , and proteosome major subunit (clpP) genes, complete 
cds; and acyl-CoA oxidase (aco) gene, partial cds. 



NT 



AA 



ORF Name 



NT ID 



AAID 



551540 cl 2ui 



4504 



Length Length 
^1— 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



#1 



ORF Name 



NTID 



NT AA 

_ _ T — T — ^. Score Probability 
AAID Length Length * c 



A0££.&±6..±1..±6A I 



TTTT 



1863 



Protein name 



Locus Name 



neopuiiuianase 



|gp:BTU66897 



Acc# 



U66897 



Description 



BacteroicLes tnetaiotaomicron neopuiiuianase (susA) andalpha-glucosidase 
(susB) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



410.i0.b.2..±l...SS. I 



AAID Length Length 
19728 



Score Probability 
2¥2 



2 . 0e~20 



Protein name 



Locus Name 



pro£>a£>le rlbosomal protein L31 



(pxr:T36353 



Acc# 



T36353 



Description 



1173 



NT 



AA 



ORF Name 



NTID 



AAID 



'4144002 cl 243 



TFZT 



Length Length 
IS73 



pr 



Score Probability 
i.0e-25 



1253 



Protein name 



Locus Name 



RNA polymerase sigma factor SigZ-like protein 



gp:AF1372W 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ribosomal protein slS-liJteprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



4172012 Cl 223 



AAID Length Length 
— 



Score Probability 
W&L 



6.0e-44 



Protein name 



Locus Name 



endo-beta-galactosidase 



gp:AF0838&6 



Acc# 



AF083896 



Description 



Fiavobacterium Keratoiyticus endo-beta-galactosidase gene, compietecds . 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length ^ 



&3.2.Q3.13....r.2L...22 



4509 



T7TT 



(1634 



6 .2e-168 



Protein name 



Locus Name 



methylmalonyl - coA decarboxylase , alpha chain 



pir;A4^094 



Acc# 



A49094 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|4&M.3.B.7....12...iaO. I [^TU 



9732 



Length Length 



Score Probability 




|2.0e-20 



Protein name 



Locus Name 



glutaconyl-CoA decarboxylase gamma subunit 



bp:AP0a0B76 



Acc# 



AF030576 



Description 



Acidaminococcus termentans methylmalonyl -CoA decarboxylase alpnasubunit 
(mmdA) gene, partial cds,* and glutaconyl-CoA decarboxylasedelta subunit 
(gcdD) , glutaconyl-CoA decarboxylase gamma subunit (gcdC) , and glutaconyl-CoA 
decarboxylase beta subunit (gcdB) genes , complete cds. 



1174 



NT 



AA 



ORF Name 



NTID 



\492206 ±1 SB 



AAID Length Length 

— 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



i! H 
1 J 



■'lit !f# 



NT 



AA 



ORF Name 



NTID 



iM51S.7....c2L...24,a I 



AAID Length Length 
9734 



Score Probability 
~ 



0.0015 



Protein name 



Locus Name 



outer membrane protein 



gp : BNROMPA 



Acc# 



L77615 



Description 



Bacteroides thetaiotaomicron outer membrane protein (susE) gene, complete 
cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



■Zll&l£R..£&.£dl I F^TT 



Length Length 



or 



Score Probability 
TJ5 " 



2.1>e~20 



Protein name 

Description 
MAP PROTEIN 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



ima2L&3„..c2L...2iaft I fot 



AAID Length Length 



Score Probability 
4.8e-2S 



Protein name 



Locus Name 



crossover junction enctodeoxynbonuc lease 



pir :B72360 



Acc# 



B72360 



Description 



1175 



NT 



AA 



ORF Name 



5890712 t3 171 



NTID AAID Length Length 

— 



Score Probability 



WTJT 



P5T 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

I^TS 1 — 



Score Probability 



[3715" 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



a:/.:7.2Laia...t2L...n& .....i 14517 



AAID Length Length 




Score Probability 



TUT" 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

ftsts 1 mrv 1 n^i — 1 nnnj2 — 



Score Probability 



Protein name 

Description 
INO-HIT — 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

, , ^ T — — Score Probability 
AAID Length Length JL 



^Mmo...±3....i6.D. 1 mr$ 



1062 



'3.8e-37 



Protein name 



Locus Name 



oxaloacetate decarboxylase, beta subumt 



Acc# 



B72324 



Description 



1176 



ORF Name 



NTID 



4 52 0 



Protein name 



beta-galactosiclase , 



Description 



NT 



AA 



T — T — ^ Score Probability 
AAID Length Length JL 

TT& 



Locus Name 



5 . Be-06 



Acc# 



T29434 



ORF Name 



Protein name 



NTID 



|147.2fllft2L..±l...li I F^T 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



TFT 



Locus Name 



Acc# 



Description 
MO-HIT 



NT 



AA 



ORF Name 



NTID 



Ii8A5.a&5...±2...3.5. I W5T2 



AAID Length Length 
TTTZ 



73T" 



Score Probability 
OT 



i.3e~06 



Protein name 



Locus Name 



M-protein 



gp:SEU73162 



Acc# 



U73162 



Description 



Streptococcus equi M-protem (seM) gene , complete cds. 



ORF Name 



NTID 



NT AA 
T — _ T — Score Probability 
AAID Length Length J - 



isaftafl&a...aa...iaa.... i fbtt 



Protein name 



Locus Name 



Acc# 



Description 
KfO-MIT 



NT 



AA 



ORF Name 



NTID 



1S604818 ci 71 



AAID Length Length 

mzz — 



7WT 



TTZT 



Score Probability 
^T5T5 



4.7e-15 



Protein name 



Locus Name 



Acc# 



colicm I receptor 



gp:ECOCIR 



Description 



E.coii colicxn I receptor gene, complete cds . 



NT 



AA 



ORF Name 



1706S>£>7£ tl IS 



NTID AAID Length Length 

9747 



T5T5 



1 YT&I 



Score Probability 

1257 



5.1e-22 



Protein name 



Locus Name 



Acc# 



IsprYJJajECdlil 



Description 

HYPOTHETICAL 25.3 KD PROTEIN IN RIMI-PRFC INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



id.ibaida.±±..± 



14526 



Length Length 
FT™ 



Score Probability 
0. 021 



FT 



Protein name 



Locus Name 



Acc# 



glutamyl -tRNA reductase 



gp:AF08006^ 



Description 



Chlorobium vibriof orme glutamyl -tRNA reductase (nemA) gene , complete cds; 
and porphobilinogen deaminase (hemC) gene, partialcds. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length — J ~ 



TTT 



TTZT 



i.ie-41 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6 02 7 



Acc# 



JC6027 



Description 



1178 



ORF Name 



124225385 ±2 38 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



AAID 



4528"" 



y/50 



Length Length 
73 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



24A22.5.Q5...±1...13. I 14529 



Protein name 



hypothetical protexn 



Description 



NT 



AA 



Length Length 



Score Probability 

m 



Locus Name 



0.0045 



Acc# 



T10699 



NT 



AA 



ORF Name 



NTID 



AAID 



15.B.0A6m..^2...&A I F^TT 



Length Length 
TTT 



Score Probability 
F£"8 



5.7e-55 



Protein name 



Locus Name 



thymxctxne kinase 



gp:AF028720 



Acc# 



AF028720 



Description 



RhocLothermus sp . 1 ITI 518 ' thymidine kinase (tdk) gene, completecds , 



ORF Name 



NTID 



NT AA 
T — _. — _ Score Probability 
AAID Length Length 



3.3A3.y.uia...ci...63. 



2844 



|2.3e-97 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:E»<3H30672 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen (rag) locus encoclinga ma] or 
immunodominant 55kDa antigen. 



ORF Name 



3417^52 cl 64 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



1614 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



S7TT5 - 



conserved Hypothetical protein 



Description 



NT AA 

— . , — i _ 1 Score Probability 
Length Length ^ 



TZU" 



S.7e~55 



Locus Name 



Acc# 



D72343 



NT 



AA 



ORF Name 



NTID 



l$±lM2...al.A± 



AAID Length Length 




1524 



Score Probability 

— " 



i.^e-14 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 



Prevoteila bryantii putative polygalacturonase , B-l , 4-endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



14535 



TTET 



Length Length 
— 



Score Probability 



Protein name 

Description 
(NO-HIT 



Locus Name 



Acc# 



1180 



NT 



AA 



ORF Name 



4351417 cl 6S 



NT ID AAID Length Length 

— 



3~3ir 



Score Probability 
TUT5 



1.4e-26 



Protein name 



Locus Name 



probable permease perM nomoiog (perM) RP630 



pir:E71668 



Acc# 
E71668 



Description 



NT 



AA 



ORF Name 



NTID 



m3..7..7.&..±2....2.i I 



AAID Length Length 
— 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



8 W 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
9760 



5¥T 



Score Probability 
628 



2 ,8e-95. 



Protein name 



Locus Name 



enclo-1, 4-beta-xylanase, 



pir ;T30909 



Acc# 



T309.09 



Description 



nit lis; 1 

lit"! 

IJfl Wr 



NT 



AA 



ORF Name 



NTID 



lAa-.7.fi5Qi„.ti...a 



AAID Length Length 



Score Probability 

242, 



2.0e-20 



Protein name 



Locus Name 



regulatory protein pchR- 2 : protein 
slrl489 :protein slrl489 



pir :S74456 



Acc# 



S74456 



Description 



ORF Name 



Protein name 

Description 
M6-HIT 



NTID 



AAID 



NT 



AA 



Length Length 
71— 



Score Probability 



Locus Name 



Acc# 



1181 



NT 



AA 



ORF Name 



NTID 



16914125 13 55 



4541 



AAID Length Length 
— 



TIT 



Score Probability 

— 



3.0e-33 



Protein name 



Locus Name 



Acc# 



032244 



Description 

HYPOTHETICAL 22.6 Kf> PROTEIN IN OPUCA-BllStO nWfiRflfiHie REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



F7TT 



Score Probability 
TT7 



.6e-0§ 



Protein name 

Description 
PRECURSOR 



Locus Name 



sp:YN23_YEAST 



Acc# 



P53832 



NT 



AA 



ORF Name 



NTID 



AAID 



14 5 4 3"™ 



3*7^5- 



Length Length 



Score Probability 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — ■ Score Probability 
Length Length — ^ 



.lftS.7.adlS..±2...2fl I 14544 



1350 | 



T31T 



I2.0e-41 



Protein name 



Locus Name 



adenylate cyclase homolog 



pir :T17197 



Acc# 



T17197 



Description 



1182 



m 



NT 



AA 



ORF Name 



10737500 rJ 4^ 



NTXD AAID Length Length 

— 



4545 



Score Probability 
ITS 



y . 3e-08 



Protein name 



Locus Name 



AnsH phosphatase 



|gp : gCAHBJTOCT 



Acc# 



AF131879 



Description 



Streptomyces colimus ansatrienin AHBA biosyntnetic gene clusterregion 2, 
complete sequence. 



ORF Name 



NTID 



NT AA 
_ — —. — ^, Score Probability 
AAID Length Length JL - 



14546 



37T" 



335" 



i.8e-Sl 



Protein name 



Locus Name 



nypothetical protein 



pir : JQ1020 



Acc# 
JQ1020 



Description 



NT 



AA 



ORF Name 



NTID 



ii&maas...ti...ia 



AAID Length Length 
5755 



Score Probability 

rm — 



2.ie-I77 



Protein name 



Locus Name 



nypothetical protein 



pir :JQ102U 



Acc# 



JQ1020 



Description 



ORF Name 



liamsfl3L..±i...a„ 



Protein name 

Description 
[NO- HIT 



NTID 



AAID 



14548 



WTTT 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



1183 



NT 



AA 



ORF Name 



NTID 



1960876 rl 2 



AAID Length Length 
TSTJl — 



Score 



FOTT 



Probability 
10.0055 



Protein name 



Locus Name 



putatxve glucosyi hydrolase precursor 



Acc# 



AF047839 



Description 



Pseudoalteromonas sp. S9 putative glucosyi hydrolase precursor andadaptive 
response regulatory protein (ada) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



196S&77& c3 S7 



WT7T 



Length Length 
TOT" 



Score Probability 
T&S 



6.6e-i-2 



Protein name 



Locus Name 



MsmR 



gp:SPU4«$7 



Acc# 



U4 93-97 



Description 

Streptococcus pyogenes MsmR (msmRJ gene, partial cds; LepA (lepA) , Cpa 
(cpa) , and Nra (nra) genes, complete cds; SsbA (ssbA) gene, partial cds; 
unknown genes. 



and 



NT 



AA 



ORF Name 



NTID 



AAID 



mrr 



Length Length 



JOT 



Score Probability 
TSu 



4.7e-3 : 5 



Protein name 



Locus Name 



hypothetical protein PAB0790 



pir:H7b098 



Acc# 



H75098 



Description 



ORF Name 



Z2.8.f5.D.l,2.8....£.X...3.., 



Protein name 



NTID 



NT 



AA 



AAID Length Length 

wrn — 



WT 



Score Probability 
0 . 031 



55 



Locus Name 



Description 

(OSTEONECTIN) (ON) (BASEMENT MEMBRANE PkOTEIN BM-40) 



Acc# 



P36378 



1184 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

wr7*> — 



83 



T5T 



Score Probability 




.0.031 



Protein name 



Description 



Locus Name 



sp :SPKC_XENLA 



Acc# 



P36378 



(0STS0MEOT1N) (ON) (BASEMENT MfiMB&ANE PROTEIN feM-40) 



ORF Name 



NTID 



AAID 



NT AA 

— . — _ Score Probability 
Length Length 



I24^2^6'77 c2 64 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



C3 



NT 



AA 



ORF Name 



NTID 



2S.a.7.0.lO...±2..13. I 



AAID Length Length 
T7T7 — 



Score Probability 
213 



Protein name 



Locus Name 



Hypothetical protein phiiov 



Ipir:£)7l051 



Acc# 



D71051 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



a5.7.a&i5s...ta...3Lfi I w^z 



TT7T 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1185 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length i ~ 



33442 ±1 5 



WTTT 



390 


1173 




153 



Protein name 



Locus Name 



transcription regulator Arac/xyis ramiiy 
homo log ydeE 



|pir:G6S777 



Acc# 
G69777 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



liS.2^0.7....cl...5.D.... I 



T7W 



Length Length 



Score Probability 
1 . 6e-06 



ITS 



Protein name 



Locus Name 



transposase 



gp:AF028866 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5520 
protein BmpH (bmpH) genes, complete cds 



transposase (JoipH) andmoJDilization 



ORF Name 



NTID 



AAID 



NT AA 
t ^f-v, t ™<-v. Score Probability 
Length Length 



3.5.5.5.5.ia2...a3....B.D.. 



9781 



3HT 



TTT~ 



1.2e-05 



Protein name 



Locus Name 



transposase 



Acc# 



AF038866 



Description 



Bacteroides tragilxs transposon Tn5520 
protein BmpH (bmpH) genes, complete cds. 



transposase (bipH) andmobi lization 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTWT 



157 | 


474 




116 





|4.5e-07 



Protein name 



Locus Name 



Hypothetical protein 8 



pir ;E69183 



Acc# 



E69183 



Description 



1186 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mm — 



11977 



Score Probability 
211 



|4.7e-34 



Protein name 



Locus Name 



sialic acicL-specitic 9-0-acetylesterase 



IgpiMMASSOA 



Acc# 



X98625 



Description 



M.musculus mRNA tor sialic acict-specitic 9-o-acetylesterase . 



ORF Name 



NTID 



NT AA 

_ _ _ _ — _ _ — _ Score Probabilit y 
AAID Length Length ■ JL 



\469±662 cl 53 



— i rr^ 



50" 



0 . 00020 



Protein name 



oligopeptide ABC transporter, ATP-JDincling 
protein 



Description 



Locus Name 
[pir:£)722$3 



Acc# 



D72289 



ORF Name 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
715 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



IAI1D.0.1O....O....47. I 



Length Length 



Score Probability 



Protein name 

Description 
FTTTTT 



Locus Name 



Acc# 



1187 



NT 



AA 



ORF Name 



NTID 



AAID 



4554552 c2 65 



Length Length 
73 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



BABJ£.2.±2J21 I 



T7W 



Length Length 

— 



Score Probability 
J> . Ue-40 



Protein name 



Description 



Locus Name 



Acc# 



'sp:YJV8_YEAST | P40892 



(EC 2.5.1. -J 



NT 



AA 



ORF Name 



NTID 



AAID 



5fill7Al.±l..il I 



Length Length 
T77T 



Score Probability 
3.0e-i7 



Protein name 



Locus Name 



hypothetical protein TM0383 



(pir:S72353 



Acc# 



G72383 



Description 



ORF Name 



NTID 



\6MA5.0.5..±1J29. ...J F^FS 



Protein name 

Description 
NO-HIT 



AAID 



9790 



NT 



AA 



— ', — , Score Probability 
Length Length — 

2FTT 



TFT" 



Locus Name 



Acc# 



1188 



ORF Name 



7031556 ±1 11 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
[NO-HIT 



ORF Name 



a£££1.7...±3L...3.5. 



Protein name 



NTID 



NT AA. 
T — T — Score Probability 
AAID Length Length — JL 



SET 



|4.4e-40 



alpha -glucosidase 



Locus Name 
|gp:BT066ffg7~ 



Acc# 



U66897 



Description 



Bacteroides thetaiotaomicron neopullulanase (susA) andalpha- glucosidase 
(susB) genes, complete cds . 



ORF Name 



NTID 



Protein name 



NT AA 
T — T — ^, Score Probability 
AAID Length Length — * L ~ 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



ixa7.4flL&i...ti...i3..„ I W^ri 



Protein name 



AAID 



NT AA 
— — Score 
Length Length 



Locus Name 



Probability 



Acc# 



Description 
MO-HIT 



1189 



NT 



AA 



ORF Name 



193757 ±2 27 



NTID AAID Length Length 

— 



4573 



Score Probability 
^175 



9.3e-S3 



Protein name 



Locus Name 



H5K outer membrane protein precursor : SusC 
protein 



pir : JC6 02 7 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID AAID Length Length 

9796 



Score Probability 
ib.9e-i&3 



pteft 



Protein name 



Locus Name 



immunoreactive 8 7kD antigen PG92 



lgp:AP175724 



Acc# 



AF175724 



Description 



Porphyromonas gingival is strain W5 0 immunoreactive 8 7KD antigenPG92 gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



2i±:m.B.i..±ijn i f^ts 



tw 



PTE" 



l . 4e-l7 



Protein name 



Locus Name 



RNA polymerase ECF-type sigma tactor sigw 



pir :H69706 



ACC# 



H69706 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



22£&am...tL..7. I 



Length Length 



Score Probability 



TuW 



Protein name 

Description 
IKT0-H1T 



Locus Name 



Acc# 



1190 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAIP Length Length JL 



24023387 t2 12 



T7W 



Protein name 



2520 



TTuT" 



Locus Name 



l.ae-lli 



Acc# 



putative secreted £>eta-galactosiclase 



Description 



gp:SCF81 



AL133171 



Streptomyces coelicolor cosmid F81. 



ORF Name 



NTID 



AAID 



NT AA 
„ — ^ ^ — , Score Probability 
Length Length 



I2421SS62 ±1 5 



Protein name 



1373" 



Locus Name 



i.3e~30 



Acc# 



hypothetical protein 



Description 



bir:S74053 



S76053 



NT 



AA 



ORF Name 



NTID 



AAIP Length Length 



Score Probability 



2MSA6.S.I..±2...2.D. I 



T70T" 



T7TT 



y .6e-i7y 



Protein name 



Locus Name 



ABC transporter (ATP-bmding protein; nomolog 
ykpA 



Description 



pir ;E69861 



ACC# 



E69861 



ORF Name 



Protein name 



NTID 



AAID 



NT 
Length 



j.m x AA 

— , — , Score Probability 
"""" Length J - 



9802 



57T 



Locus Name 



Acc# 



Description 
INO-HI T 



NT 



AA 



ORF Name 



NTID 



4581 



AAID Length Length 
TY513 — 



Score Probability 
TTOT 1 ll.iie-132 



Protein name 



Locus Name 



immunoreactive neat snock protein DnaJ 



|gp:AF14579V 



Acc# 



AF145797 



Description 



Porphyromonas gingivalxs strain W5 0 immunoreactive heat shocJtprotem DnaJ 
gene, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



34181503 tl 2 



9804 



SIT 



Protein name 



Locus Name 



outer membrane protein 



gp : BNROMPB 



Acc# 



L77614 



Description 



Bacteroides thetaiotaomicron outer membrane protein (susD) gene , complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



19805 



Length Length 
FT" 



Score Probability 



Protein name 



Locus Name 



Acc# 



pi 



Description 
NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



3.y.7.$..7.£0A.±2...U 



AAID Length Length 



muz 



Score Probability 
407 



l.ie-70 



Locus Name 



115K outer membrane protein precursor ; SusC 
protein 



pir:JC6027 



Acc# 



JC6027 



Description 



1192 



ORF Name 



NTID 



NT AA 

_ ^ — _ — _ Score Probability 
AAID Length Length JL 



36360255 £2 26 



1.9e-22 



Protein name 



Description 



Locus Name 



sp : PLC_BACCE 



ACC# 
P14262 



{&H0S&HATI&VLIM0SIT6L-SMC1PIC &H0SfcH0LI&ASfi 0) (£>1-£>LC) 



ORF Name 



36540523 r2 25 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 

srs — 



Score Probability 



85 



Locus Name 



Acc# 



Description 



9t m 



GRF Name 



4:&am,xA...aa„. 



Protein name 



NTID 



surtace antigen BspA 



Description 



NT 



AA 



AAID Length Length 

— 



T5T 



Score Probability 
5.7e~07 



15? 



Locus Name 



pir :T31094 



ACC# 



T31094 



ORF Name 



NTID 



Aa.7.afi2L3...±i„.a i fsto 



Protein name 



AAID 



probable transmembrane protein 



Description 



NT AA 

— . — , Score Probability 
Length Length 



Locus Name 



pir :T3465i 



3.1e-22 



Acc# 



T34651 



1193 



NT 



AA 



ORF Name 



NT ID 



AAID 



I6283ib0 il 1 



Length Length 



Score Probability 
2 . 2e-78 



1W5 



Protein name 



Locus Name 



immunoreactive 8 7KD antigen PG92 



gp;AF17B724 



Acc# 



AF175724 



Description 



Porphyromonas gmgivaiis strain W50 immunoreactive 87KD antigenPG92 gene, 
complete cds. 



NT 



AA 



ORF Name 



NTID 



6640682 CI 55 



14590 



AAID Length Length 
WT2 — 



Score Probability 
1.6e-27 



33? 



Protein name 

Description 
GRPE PROTEIN 



Locus Name 



'sp:0REP_RRATI3 



Acc# 



P48204 



s 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
?ST3 — 



Score Probability 
5U3 



l..le-B8 



Protein name 



Locus Name 



integrase 



gp:BFU7537I 



Acc# 



U75371 



Description 



Bacteroides tragills transposon Tn4555 TnpA (tnpAj , integrase (mt) , TnpC 
(tnpC) , excisionase (xis) , mobilization protein (mobA),and beta-lactamase 
(cfxA) genes, complete cds; and unknown genes. 



NT 



AA 



ORF Name 



NTID 



AAID 



lA£££l£i...cl...a££ 1 



WIT 



Length Length 
TTT2 — 



F7T 



Score Probability 
5.3e-^4 



T7T 



Protein name 



Locus Name 



hypothetical protein S110855 



bir:S746S3 



Acc# 
S74833 



Description 



1194 



ORF Name 



1203515 c3 



Protein name 



NT ID 



AAID 



^STE- 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
WO-HIT 



ORF Name 



NTID 



117B.0.D.M...cI...15.5. I 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



1N0-UIT 



ORF Name 



NTID 



AAID 



l 13.7.5.iA50..7...±3,...llfl 1 



mrr 



Protein name 



hypothetical protein 3hp065l 



Description 



NT AA 
* — , * — ^ Score Probability 
Length Length 



Locus Name 



pxr :E71905 



5..9e-10 



Acc# 



E71905 



ORF Name 



Protein name 



NTID 



AAID 



75T5" 



NT 



AA 



Length Length 

52 — n — 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length • J ~ 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NT ID 



Protein name 



K+ transport protein homolog 



Description 



NT 



AA 



„ — . — . Score Probability 
AAID Length Length 





Locus Name 



pir:H70430 



l.Je-53 



Acc# 



H70430 



NT 



AA 



ORF Name 



NT ID 



AAID 



±6ARIL1S...±2..1Q. I 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



X6A33±±Q....&1...15.1 I 



Length Length 
75"" 



Score Probability 



rrnr 



Protein name 

Description 
IHO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 

— , — _ Score Probability 
Length Length i - 



207 



Protein name 

Description 
[EKPHTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



16.a32.8.a5....al...l6.6.., 



9824 



Length Length 



Score Probability 

im — 



2 . 3e-177 



Protein name 



Locus Name 



hypothetical protein 



pir : JQ1020 



Acc# 
JQ1020 



Description 



1196 



ORF Name 



NT ID 



AAID 



NT AA 
„ — , ^ — . Score Probability 
Length Length JL 



176tiV5 cl lb6 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



iaittiflL„±i„.2a I prew 



Length Length 
1158 I P~^7 



Score Probability 
1.5e-06 



TIT 



Protein name 

Description 
HYPOTHETICAL PftOTEIM MJECSG2 



Locus Name 



sp:YY02_METJA 



Acc# 



Q60301 



NT 



AA 



ORF Name 



NTID 



AAID 



a,a7.Z63.a.7....tl...S. I 14605 



Length Length 
1473 



Score Probability 
— 



6 . Oe-273 



Protein name 



Description 



Locus Name 



sp :CATB_BACFR 



Acc# 



P45737 



CATALASE, 



NT 



AA 



ORF Name 



NTID 



AAID 



9828 



Length Length 

1167 I rrera — 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1197 



ORF Name 



'20101577 tl SO 



Protein name 



hemm permease 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
TUUB — 



JJ4 



Score Probability 
FIT 



3 . Oe-49 



Locus Name 



pir :S54438 



AcC# 
S54438 



ORF Name 



^U0,lUa3.a...Jt.l...l4... 



Protein name 



NT 



AA 



NTID 



AAID Length Length 
9830 



1386 



Score Probability 
1259 



Locus Name 



tryptophan synthase, subunit beta (trpB-i; 
homo log 



prr :G69404 



Description 



Acc# 



G69404 



NT 



AA 



ORF Name 



NTID 



l 2fltaiiSi3....al...2fifi I 



AAID Length Length 



Score Probability 

'204 



2.7e-l6 



Protein name 



Locus Name 



RNA polymerase sigma factor SigZ-liJce protein 



gp:APi^7264 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 3 OS ribosomal protein S16 -HKieprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



NTID 



AAID 



NT AA 
„ — , — , Score Probability 
Length Length — 



2fla2flI7.7„..t2...2S 



14 610 



TUT 



0.0042 



Protein name 



Locus Name 



branched- chain amino acid. ABC transporter, 
ATP-binding protein (braG-4) homolog 



pir :D69423 



Acc# 
D69423 



Description 



1198 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mrs — 



Score Probability 
l . ve-94 



B3T 



Protein name 



Description 



Locus Name 



Acc# 



|sp:END4_EC0LI 



ENDOWUCLEASE IV, IV) 



.si! n 1 . 



ORF Name 



20517142 t2 74 



Protein name 

Description 
WO-HXT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



fi 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length r 1 — 



FITS" 



ITT5T" 



Protein name 
Description 

HYPOTHETICAL PROTEIN MJECL4 5 



Locus Name 



sp:YZ3!> METJA 



Acc# 



Q60291 



1199 



NT 



AA 



ORF Name 



NTID 



rl Tl 



AAID Length Length 

mn — 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



amaAii..±i«ii I 



AAID Length Length 
53T3 — 



Score Probability 



in* 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



121&£M.l...a±...±Lti. I 



NTID AAID Length Length 

[5535 — 



Score Probability 



I51T 



PT53" 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



|2Z£6.i:/.13...X2„.6.2 



FISTS 



NTID AAID Length Length 
SMu" 



82 



Score Probability 

prm 



13 . 9e-05 



Protein name 



Locus Name 



bp:BH)S3-767 



Acc# 



U53767 



Description 

Bacillus pumilus plasmid pSH1452, Rep gene, complete cds. 



1200 



ORF Name 



NT ID 



AAID 



NT AA 

— , • — , Score Probability 
Length Length • £ - 



22S£0128 c2 211 



9841 



"ST 



0.031 



Protein name 



Description 



Locus Name 



Acc# 



P36378 



(OSTEONECTIN) (ON) (BASEMENT MEMBRANE PROTEIN BM-40) 



NT 



AA 



ORF Name 



NTID 



AAID 



'2347&176 tl 25 



14620 



Length Length 
TTT 



T73" 



Score Probability 
0.00045 



5? 



Protein name 



Locus Name 



TnpC 



lgp:BFU75:m 



Acc# 



U75371 



Description 



Bacteroides tragi! is transposon Tn4 55 5 TnpA (tnpA) , mtegraselmt) , TnpC 
(tnpC) , excisionase (xis) , mobilization protein (mobA),and beta-lactamase 
(cfxA) genes, complete cds; and unknown genes. 



% as? 

ru 

n 



ORF Name 



NTID 



NT AA 
T — T — ^_ Score Probability 
AAID Length Length ■ — J - 



7TT 



TIT 



ST 



0.0053 



Protein name 



Locus Name 



n YP ot hetical protein BB0404 



lpir:C70150 



Acc# 



C70150 



Description 



ORF Name 



NTID 



AAID 



mamiL±:L.7........ i wzTi 



9844 



Protein name 



putative alpha-glucosictase 



Description 



NT AA 

— — Score Probability 
Length Length 



[21T5Tr 



T2T 



4.5e-29 



Locus Name 



|gp:AAC252161 



Acc# 



AJ252161 



Alicyclobacillus acidocaldarius maltose/maltodextrine transportgene region 
(malEFGR genes, cdaA gene and glcA gene) . 



1201 



ORF Name 



NT ID 



NT AA 

_ ^ — ^, T — Score Probability 
AAID Length Length JL 



24020312 c2 221 



2889 



2.7e-37 



Protein name 



Locus Name 



Acc# 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6 02 7 



JC6027 



Description 



ORF Name 



Protein name 



NTID 



RETT 



AAID 



NT AA 
T — j Score Probability 
Length Length : 



T23~ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT 



AA 



NTID 



242AU.7.7...±l...a3t 



AAID Length Length 
— 



Score Probability 
— 



Protein name 



Locus Name 



beta -lactamase, A precursor : cephalosporinase 



pir:I40152 



Description 



1.6e~160 



Acc# 



ORF Name 



NTID 



NT AA 

— , - — , Score Probability 
AAID Length Length — ^ 



9848 



73^ 



Protein name 



Locus Name 



0.036 



Acc# 



unknown 



lgp:AF04874s) 



Description 



AF04 8 74 9 



Bacteroides tragilis capsular polysaccharicie biosynthesis operon, complete 
sequence. 



1202 



ORF Name 



N'T ID 



AAID 



NT AA n n , , , . , ^ 
— , — , Score Probability 
Length Length ^ 



24415532 ±3 107 



9845 



1269 



0.00030 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI 06 6 5 



Locus Name 



sp:Y665_HAEIN 



Acc# 
P44033 



NT 



AA 



ORF Name 



NT ID 



AAID 



124548550 ±3 105 



Length Length 
2TT" 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
. "T. , T — . , Score Probability 
Length Length 



25.£&Mil...c2.,..22.2 I ESZS 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 

1215 \ntt% — 



Score Probability 
0.014 



S3" 



Protein name 



Locus Name 



rhoptry protein 



pir :T2B676 



Acc# 



T28676 



Description 



1203 



ORF Name 



'26594683 c2 185 



Protein name 



NT ID 



NT AA „ ^ _ , , n . ^ 
_ __ — , „. — _ Score Probability 
AAID Length Length JL 



2187 



Locus Name 



Acc# 



Description 
NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



TOFT 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT ~ 



ORF Name 



NT 



AA 



NTID 



3MS.m..±2...6.6. „....! 



AAID Length Length 
9855 



Score Probability 
TT3 — ~ 



4.0e-06 



Protein name 



Locus Name 



AJDlEil 



tgp:LLTO6837 



Acc# 



U36837 



Description 



Lactococcus lactis plasmici pNP4 0, abortive inlection locus, AbiEi , AbiEii , 
RecA(LP) , AbiF genes, complete cds . 



ORF Name 



NT AA 

„ m „ ~r — i _ 1 _ — n Score Probability 
NTID AAID Length Length JL 



4634 



5F" 



291 



Protein name 



Locus Name 



Acc# 



Description 
[MO- HIT 



1204 



ORF Name 



32243757 il 24 



Protein name 



Description 



NO- HIT 



NT 



AA 



NT ID 



14535 



AAID Length Length 
ITTu 



Score Probability 



m57 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



14636 



9858 



Length Length 



mo 



Score Probability 
7.6e-24 — 



Locus Name 



|sp:XYLB_BAC0V 



Acc# 



P49943 



m 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



4637 



AAID Length Length 
£T5 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



NT AA „ 

— , — , Score Probability 
AAID Length Length 



isA6.M52.±i...9.b. .1 



TZT 



S.Oes-63, 



Locus Name 



hypothetical protein 2 



pir:I40im 



Acc# 



140233 



Description 



ORF Name 



NTID 



Protein name 



DnaK 



NT 



AA 



AAID Length Length 



Score Probability 
|4.5e-268 



Locus Name 



gp:AB01bB7y 



Description 

Porphyromonas gingivalis dnaK operon genes, complete cas . 



Acc# 



AB015879 



1205 



NT 



AA 



ORF Name 



NT ID 



3954627 c3 225 



AAID Length Length 
— 



OF" 



"SWT 



Score Probability 
255 



|4.ae-26 



Protein name 



Locus Name 



75KFS" 



E 



p;AB015a7<< 



Acc# 



AB015879 



Description 



Porpnyromonas gingxvalxs dnaK operon genes, complete cds . 



p 

s y ■ 

an* 
I fitf 

^ J 



NT 



AA 



ORF Name 



NTID 



I4l47l26 ti 32 



AAID Length Length 
^7 



Score Probability 
533 



I2.ie-51 



Protein name 



Locus Name 



5 1 -nucleotidase 



|gp:CL1131243 



Acc# 



AJ131243 



Description 
ColumJoa livia mRNA tor 5 ' -nucleotidase . 



ORF Name 



NTID 



NT AA 
T — _ — ^ Score Probability 
AAID Length Length J - 



4642 



TFT 



Protein name 

Description 
NO-HIT ; 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



|^2..7.aa...Cl...l6..7.. 



4643 



5T 



TO" 



TFT 



|3.1e-ll 



Protein name 



Na+-ATPase chain J:protem slrl509 :protein 
slrl509 



Locus Name 
[pir:575455 



Acc# 



S75455 



Description 



1206 



NT 



AA 



ORF Name 



NT ID 



AAID 



4727280 12 87 



Length Length 



11191 



Score Probability 
ET7 



Protein name 
Description 

HYPOTHETICAL PROTEIN MJ0878 



Locus Name 



sp:Y878 METJA 



Acc# 



Q58288 



ORF Name 



NTID 



AAID 



NT AA 

— — — ^ Score Probability 
Length Length ^ 



4727337 ri 2^ 



crrr 



l. 4e-l3 



Protein name 



Locus Name 



hypothetical protein PAB1002 



pir :G75064 



Acc# 



G75064 



Description 



ORF Name 



NTID 



AAID 



NT AA 
, — ■ — , Score Probability 
Length Length — J - 



A&M&3.2..±2....5.£ 1 



S7T 



Protein name 

Description 
KfO-HM 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— _ — Score Probability 
Length Length — —t - i- 



5.2..740.D.3....cl...Iia I 



12175 



3.0e-52 



Protein name 



Locus Name 



otnA protein 



pir :S70958 



Acc# 



S70958 



Description 



ORF Name 



S.7.^.45..7....cl...li0.., 



Protein name 

Description 
INO-HTT 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



[2TT 



Locus Name 



Acc# 



1207 



ORF Name 



NTID 



AAID 



NT AA 

— , ■ — , Score Probability 
Length Length • L - 



1984905 cl 16^ 



2TT 



TUT 



6.2e-26 



Protein name 



Locus Name 



Acc# 



conserved Hypothetical protein aq__15 03 



pir :G70430 



G70430 



Description 



ORF Name 



NTID 



NT AA 

^ ^ — . ^ — . Score Probabi lity 
AAID Length Length L - 



0.^1.Xa6.2....t.2L...a3... 



~§WTT 



5T" 



0.0068 



Protein name 



Locus Name 



CrylA toxin receptor A 



gp:AP173J5W 



Acc# 
AF173552 



Description 



Heliothis virescens CrylA toxin receptor A mRNA, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
1...3e-31. 



Protein name 



Locus Name 



putative putrescine/spermidine binding 
protein 



igp : P^EPAHP 



Acc# 



L49465 



Description 



Pseudomonas tluorescens hypothetical metabolite transport protein, positive 
transcriptional regulator (phnR) , phosphonoacetatehydrolase (phnA) , 
2-phosphonopropionate transporter (phnB) , putative putrescine/spermidine 
binding protein, and putativemethionine sulfoxide reductase genes, complete 

cds . j 



NT 



AA 



ORF Name 



NTID 



AAID 



\LMM2&5..±1...2!i I W&oZ 



Length Length 



3210 



Score Probability 




1.8e-48 



Protein name 



Locus Name 



histidme protein kinase homolog GacS 



gp: API 9 7 912 



Acc# 



AF197912 



Description 



Asotobacter vmeiandii histidme protein Kinase nomolog GacS (gacS) gene, 
complete cds. 



1208 



ORF Name 



NTID 



AAID 



NT AA 

— , — ■ Score Probability 
Length Length 



14225553 t3 154 



19875 



75" 



1.6e~ll 



Protein name 



Locus Name 



nypotneticai protein APK2 061 



pir :G72510 



Acc# 



G72510 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



14i8.5.g.S.a...tl...5.3. I FF5^ 



Length Length 
ffTFS 



Score Probability 



|4.5e-16 



Protein name 



Description 



Locus Name 



|sp : TLPA__BRAJA 



ACC# 



P43221 



J>ROTSIN TLL>A) 



NT 



AA 



ORF Name 



NTID 



AAID 



iimftfifi.2..±a„.i&Q I 



Length Length 



PT5T 



Score Probability 




2.0e-27 



Protein name 



Locus Name 



nypotneticai protein MTH671 



pir :D6yi89 



Acc# 



D69189 



Description 



ORF Name 



NTID 



145.3.&0.3.5....a2...2.7.1.. 



Protein name 



NT 



AA 



AAID Length Length 
TTFk 



F3T 



Score Probability 
TTT5 — 



1.5e-134 



Locus Name 



proJDaJDle v-type ATPase, subunit A (atpA-l) 



pir:G71325 



Acc# 



G71325 



Description 



1209 



NT 



AA 



ORF Name 



NT ID 



AAID 



15712655 ri 24 



Length Length 



Score Probability 

\m — 



Protein name 

Description 
HYPOTHETICAL fllOlOS 



Locus Name 



sp:YJJP_HAJ3IN 



Acc# 
P44520 



NT 



AA 



ORF Name 



NT ID 



AAID 



15713042 c2 270 



— ^ n T — ^_ Score Probabi lity 
Length Length J ~ 

355" 



y.ie-05 



Protein name 



Locus Name 



Hypothetical protein BB009b 



pxr :G7011l 



Acc# 



G70111 



Description 



NT 



AA 



ORF Name 



NTID 



16.519.Gl&...c3....3£l I W^V 



AAID Length Length 
FTJT5 



TOST 



Score Probability 




y.0e-40 



Protein name 



Locus Name 



2-keto-3-deoxyg±uconate Kinase 



pir :G72422 



Acc# 



G72422 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



1&6.1M2.:/...±1...3.7.., 



Length Length 
1407 



Score Probability 
753 — ~ 



1.4e-74 



Protein name 



Locus Name 



Na+/H+ antiporter (nnac-l) nomolog 



pir :U7U179 



ACC# 



D70179 



Description 



1210 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length • * L 



1683345b c2 216 



|i.4e-57 



Protein name 



Locus Name 



catxon erllux system protexn 



gp:AF2038Sl 



Acc# 
AF203881 



Description 



Zymomonas mobilis strain ZM4 clone 43F4, complete sequence. 



ORF Name 



1S667750 cl 228 



Protein name 



NT 



AA 



NTID AAID Length Length 

9884 



Score Probability 



7T 



Locus Name 



Acc# 



Description 
NO-HIT 



fMii 



O 



ORF Name 



NTID 



AAID 



NT AA 

— , — / Score Probability 
Length Length — ■ ^ 



ili7.mL±I..lifi I [4^T 



9885 



7F 



Protein name 



Description 



Locus Name 



"sp:&RSfiJlAHSE 



Acc# 



P46S07 



265 PROTEASE REGULATORY SUBUNIT 6B (MEAS E M5 7 3) 



NT 



AA 



ORF Name 



NTID 



2i5.1b.6.7.S....a3....3.3.i I 



AAID Length Length 
2598 



Score Probability 
1621. 



i.5e-16S 



Protein name 



Locus Name 



hypothetical protein PH1512 



pir:D71027 



Acc# 



D71027 



Description 



1211 



NT 



AA 



ORF Name 



NTID 



22445301 t2 87 



AAID Length Length 

m$i — 



or 



Score Probability 
l2.0e-07 



122 



Protein name 



Locus Name 



unknown 



gp:AF125164 



Acc# 



AF125164 



Description 



Bacteroides fragiiis 638R polysaccharide B (PS B2J biosynthesislocus , 
complete sequence; and unknown genes. 



NT 



AA 



ORF Name 



NTID 



AAID 



22454707 t3 161 



WW 



Length Length 



TTJ7T" 



Score Probability 
TJM 



1.7e-l42 



Protein name 
Description 

(RIBONUCLEOTIDE REDUCTASE) 



Locus Name 



sp:RIR2_TOE£>A 



Acc# 



083092 



NT 



AA 



ORF Name 



NTID 



2.3.5.9.5.18.Q...G1...2.6.6.. 



AAID Length Length 
¥T5T" 



9889 



\FITT 



Score Probability 
l.Ve-21 



TOT 



Protein name 



Locus Name 



|sp:YRK0_BA(J£5U 



Acc# 



P54442 



Description 

HYPOTHETICAL 46 .4 KT) PftOTHIN IN BLTR-SMI1IC INTfi&GEMIC REGION 



ORF Name 



NTID 



NT AA 
■ — _ T — ^, Score Probability 
AAID Length Length — *~ 



2.3.6.3.1.6.2.7....G1..,Z1,3. I 14668 



l.2e-102 



Protein name 



Locus Name 



probable V-type ATPase, subunit B tatpB-i) 



|pxr:H7l52B 



Acc# 



H7132 5 



Description 



1212 



NT 



AA 



ORF Name 



NT ID 



124253311 13 167 



AAID Length Length 
~5W$I — 



Score Probability 
i . 9e-15 



2T7 



Protein name 



Description 



Locus Name 



Acc# 



gp:AB016260 



Agrobacterium tumetaciens piasmicl pTi-SAKURA, complete sequence . 



NT 



AA 



ORF Name 



NT ID 



AAID 



WW 



Length Length 



Score Probability 




1.5e-63 



Protein name 



Locus Name 



TonB - dependent receptor HmuR 



|gp:E>GTO7355 



Acc# 



U8 73 95 



Description 



Porphyromonas gingival is TonB- dependent receptor HmuR (hmuR) gene , complete 
cds . ■ 



ORF Name 



NTID 



NT AA 
T — ^ — ^, Score Probability 
AAID Length Length 



1872 



S.le-54 



Protein name 



Locus Name 



V» type ATPase, su&unit I nomolog 



pir:C70111 



Acc# 



C70111 



Description 



ORF Name 



NTID 



|M5.0.3.m...c3....3.5.0... 



Protein name 



2-Jteto-3-deoxygIuconate Kinase 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 
3.3e-47 



Locus Name 



pir :G72422 



Acc# 



G72422 



1213 



NT 



AA 



ORF Name 



NT ID 



24B0S567 cl 



AAID Length Length 
— 



Score Probability 

^re — 



Protein name 



Locus Name 



conserved Hypothetical protein MTH12 8 5 



Description 



pir :A69038 



Acc# 



A69038 



NT 



AA 



ORF Name 



, TmTr , ™^ T — — — Score Probabil i ty 
NT ID AAID Length Length JL 

m$z — 



|4.2e-82 



Protein name 



Locus Name 



3 0S ribosomal protein S16-IiJce protein 



gp77£FTT77ST 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 3 OS riJoosomal protein Sl6-lik:eprotein, tucose 
gene cluster, and RNA polymerase sigma factorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



ORF Name 



NTID 



AAID 



Length Length 



AA 

— Score Probability 



Protein name 

Description 
NO-HIT : 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA ^ n . , , , , ^ 
, — _ — , Score Probability 
Length Length 



1.8e-i9 



Protein name 



Locus Name 



conserved .hypothetical protein yvbK 



pir :B70030 



Acc# 



B70030 



Description 



1214 



ORF Name 



NT ID 



126355551 c2 257 



14677 



Protein name 



Hypothetical protexn 



Description 



NT 



AA 



AAID Length Length 
— 



ITT 



Score Probability 
|1.0e-22 



Locus Name 



(pxr:B75629 



Acc# 
B75629 



NT 



AA 



ORF Name 



NT ID 



|2ftll7.a3.Q...Gl-..iiI I ffS75" 



AAID Length Length 
WOT 



T7T" 



1116 



Score Probability 
513 1 |7.6e-2E 



Protein name 



Locus Name 



hypothetical protein 



pir:H7552S 



Acc# 



H75628 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



\l$MMA2...a2...16& I F£7S 



Length Length 



Score Probability 



1W 



Protein name 

Description 
KFTTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\10A$.1115....alJX12.. 



4680 



9902 



Length Length 



Score Probability 



Protein name 

Description 
WO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

r^i-i, r^f-in Score Probability 
Length Length 



5.U6.±0.±6....al...l&l± I PSST 



mur 



T5W 



I7T5TT 



5.Se-64 



Protein name 



Locus Name 



sp:WCA_bhC£U 



Acc# 



P54585 



Description 

HYPOTHETICAL bti.i KD PROTEIN IN SLM5-CSPB INTERGENIC REGION 



1215 



ORF Name 



NT ID 



NT AA 
_ — ^ T — _ Score Probability 
AAID Length Length -L 



WOT" 



TOT" 



0.021- 



Protein name 



Locus Name 



probable erytiirocyte-JDinding protein MAEBL 



Description 



pir :T09129 



Acc# 



T09129 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



3.Z0AZ^5i3....c2...3.i:/. 



9905 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



1211&1S:L±2...9± 



Protein name 



NTID 



AAID 



4684 



hypothetical protein MTH6 7 0 



Description 



NT 



AA 



Length Length 
33* 



TT7" 



Score Probability 
71 



0.015 



Locus Name 



pTrTCFSTST 



Acc# 



C69189 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



mi£ftfift...ca...m i 



9907 



ITS" 



ffTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



^±A0±bS2....c±...210. I 



AAID Length Length 




TFT 



Score Probability 
3BS 



i.ie-35 



Protein name 



Locus Name 



peptide chain release factor homolog pr±H 



ir;E64748 



Acc# 



E64748 



Description 



1216 



NT 



AA 



ORF Name 



NT ID 



AAID 



9909 



Length Length 

— 



Score Probability 
2 .4e-06 



ITT 



Protein name 



Locus Name 



hypothetical protein 



bp:SSU18S>30 



Acc# 



Y18930 



Description 



Sultolobus soltataricus 281 Kb genomic DNA rragment, strain P2 . 



ORF Name 



NTID 



4688 



Protein name 



cobalamm biosynthesis protexn N 



Description 



NT 



AA 



AAID Length Length 

mw — 



14*57 I 



Score Probability 
|4.4e-125 



687 



Locus Name 



pir :C6H048 



Acc# 
C69048 



NT 



AA 



ORF Name 



NTID 



AAID 



14.0.6.5.9.26....C1...16.5. I 



^TF 



Length Length 
231 



Score Probability 

est 



Protein name 



Locus Name 



hypothetical protein aq_l06 0 



pir :D70391 



Acc# 



D70391 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J ~ 



i4flai4U5...ci...212 1 WSWG 



isnr 



TTD~ 



i.9e-06 



Protein name 



Locus Name 



Hypothetical protein PHS004 



pir :F71245 



Acc# 



F71245 



Description 



1217 



NT 



AA 



ORF Name 



NTID 



' ^42422^ cl 221 



AAID Length Length 
19913 



Score Probability 
1 . 8e-44 



Protein name 



Locus Name 



spermiame/putrescine ABC transporter; 
permease protein (potC) homolog 



binCi ' /OlVi* 



Acc# 



G70179 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3.4.6A8.5.5..7....C.2....27.9... 



Length Length 
TT7~ 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



1254 



Score Probability 
0.035 



Tf 



Protein name 



Locus Name 



hypothetical protein DKFZp566Di824 . l 



pir :T14767 



ACC# 



T14767 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
9916 



7ST" 



Score Probability 




5.1e-22 



Protein name 



Locus Name 



sp:YJJP_ECOLI 



Acc# 



P39402 



Description 

HYPOTHETICAL 30,5 KD PROTEIN IN DNAT-BGLJ INTERGJilUIC REGION (F277) 



1218 



NT 



AA 



ORF Name 



NTID 



3535211 r2 85 



AAID Length Length 

mn — 



Score Probability 

— 



1.4e-15 



Protein name 



Locus Name 



TonB- dependent receptor HmuR 



|gp:PGUa73 9B 



Acc# 



U87395 



Description 



Porphyromonas gingival Is TonB- dependent receptor HmuR (hmuR) gene, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



4055152 cl 241 



Length Length 




Score Probability 
T&2 



Protein name 



Locus Name 



2 - dehydro- 3 - deoxyphosphogluconat e 
aldolase /4- hydroxy- 2 -oxoglutarate aldolase 



pir:F72422 



Acc# 



F72422 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



pur 



T5T 



Protein name 



Locus Name 



V- type ATPase, subumt E homo log 



foir:H76ili 



Description 



Probability 
6 . ye-11 



Acc# 



H70111 



ORF Name 



NTID 



AAID 



NT AA 

— - , — , Score Probability 
Length Length 



4UB.&$.2....al„.217.., 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1219 



ORF Name 



489680 ti 43 



Protein name 



NT ID 



NT AA 

— L1 — J , Score Probability 
AAID Length Length • L - 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



A9.0A17£..±2...5.2 



Protein name 



NTID 



AAID 



NT AA 

— _ — ^ Score Probability 
Length Length — ^ 



2 812 



Locus Name 



ACC# 



Description 

( RiBONUCLSOT IDE REDUCTASE) 



Sp:RIRl_TREPA 



083972 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — : ^ 



'A9.5£5.11.±L.±1 1 (T7T5T 



Protein name 



TUB" 



0.012 



Locus Name 



conserved hypothetical protein AF1223 



pir :F69402 



Acc# 



F69402 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— „ — , Score Probability 
Length Length ; — — 



Protein name 



4 7 02 



Locus Name 



Acc# 



Description 



1220 



NT 



AA 



ORF Name 



mm 



T — , — ^ Score Probability 
NT ID AAID Length Length L - 

vsrs 1 wm — ihto 1 irror 



i.be-101 



Protein name 



Locus Name 



spermidine/ putrescme ABC transporter, 
ATP-binding protein (potA) homolog 



pir :A70180 



Acc# 
A70180 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



52&5.m...cl...21i I VTTM 



Length Length 
2W 



Score Probability 
2TE 



2.9e-19 



Protein name 



Locus Name 



provable V- type ATPase, subunit D (atpD-1) 



pir;A71326 



Acc# 



A71326 



Description 



NT 



AA 



ORF Name 



NTID 



l£ftaai&i...ca...a4£ i ftus 



AAID Length Length 
9927 



1135" 



Score Probability 
1009 



l.le-101 



Protein name 



Locus Name 



rtcB protein 



pir:D75521 



ACC# 



D75521 



Description 



NT 



AA 



ORF Name 



NTID 



5ajaa£iX2L„.a^.„Jt3La I 14706 



AAID Length Length 
9928 



Score Probability 
|4.&e-50 



427 



Protein name 



Locus Name 



Acc# 



UDPgiucose- -glycogen giucosyltransterase , , 
skeletal muscle : glycogen (starch) 
synthase : glycogen f starch) synthase 



pir:A333« 



Description 



1221 



NT 



AA 



ORF Name 



NTID 



AAID 



6912585 c2 278 



WTUT 



Length Length 
"TEW 



FET7" 



Score Probability 
E2B 



6 . 3e-40 



Protein name 



Locus Name 



spermidme/putrescine ABC transporter, 
permease protein (potB) homolog 



bir:H7017S 



Acc# 



H70179 



Description 



NT 



AA 



ORF Name 



l(122$.5.1„.a±..:2.±l I 



NTID AAID Length Length 

3*3313 — 



Score Probability 



7T" 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , ^ — , Score Probability 
AAID Length Length — 



m56ti...ai...iia I fto? 



\6.2e-26 



Protein name 



Locus Name 



glycine -rich RNA-bincLing protein (clone A81) 



bir:S31442 



Acc# 



S31443 



Description 



f 3 



NT 



AA 



ORF Name 



NTID 



.7.ami...Gi...a2L£ i ftts 



AAID Length Length 




Score Probability 
0 . 036 



55 



Protein name 



Description 



Locus Name 



Acc# 



U53466 



Cydia pomonella granulosis virus ORF13L gene, partial cas, ORF15L, ORF15R, 
0RF16L, ORF17L genes, complete cds, 0RF17R gene, partialcds . 



NT 



AA 



ORF Name 



NTID 



AAID 



970252 c2 274 



WTTT 



Length Length 
TST" 



Score Probability 




|6-le-15 



Protein name 



Locus Name 



hypothetical protein PH1980 



pxr:D71214 



Acc# 



D71214 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



4 712 



9934 



Length Length 



Score Probability 
7¥3 



3.7e-74 



Protein name 
Description 

HYPOTHETICAL PROTEIN HI 003 5 



Locus Name 



sp:YIDE_HAIi!lM 



Acc# 



P44472 



NT 



AA 



ORF Name 



NTID 



AAID 



llB.8.Zy.2Lb...Xl...Z£ 



ff7TT 



TOTS" 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



119.Z9.D.ia...ta...3.U 



Length Length 
T5— 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1223 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length Jl - 



\±±929$C) ri 6 



9937 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



mmD.^..c3....18A I FTTF 



9938 



Length Length 
1560 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



m 



ORF Name 



NT ID 



AAID 



lim5&ai..±a„.ia 



[¥7TT 



Protein name 



hypothetical protein TM0280 



Description 



NT 



AA 



Length Length 



2208 



Score Probability 
3 . 4e-76 ~ 



SOT 



Locus Name 



pir:F72395 



Acc# 



F72395 



Hi 



ORF Name 



NT ID 



AAID 



Protein name 
Description 

%AS-LWE GTE' -BINDING E>kOTEIN RYL2 



NT 



AA 



Length Length 



Score Probability 
S3 



Locus Name 



0.017 



sp:RYL2_YARLI | 



Acc# 



P41925 



1224 



ORF Name 



13055438 t2 54 



Protein name 



F8~ 



Locus Name 



Acc# 



Description 
IMO-HIT 



ORF Name 



Protein name 



NT ID 



I3..7.aas.s2..±2...a?. i mru 



NT AA 
T — ^, T — . Score Probab ility 
AAID Length Length J - 



TUT 



Locus Name 



Acc# 



Description 
IMO-HIT 



ORF Name 



Protein name 



FucR 



Description 



NT ID 



AAID 



NT AA 

— . — - Score Probability 
Length Length * L - 



vnrr 



1092 



E 



.4e-50 



Locus Name 



gp:AF137263 



Acc# 
AF137263 



Bacteroides tnetaiotaomicron 30S ribosomal protein S16 -HKeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



ia.7.6.3..7....c2.,..13.D..... J \TTT2 



Length Length 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
IMO-HIT 



1225 



NT 



AA 



ORF Name 



NTID 



AAID 



22353385 £2 44 



WTZT 



Length Length 
TFT 



TTF 



Score Probability 
TT3 



6 . oe-28 



Protein name 



Locus Name 



hypothetical protein siroeyy 



pir :S77038 



Acc# 



S77038 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



ama2fl£...ci...iai I wrzz 



Length Length 



1182 



Score Probability 
|S.3e-9i 



Protein name 



Locus Name 



hypothetical protein 



pir :H72299 



Acc# 



H722 99 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , , — , , Score Probability 
Length Length ' L ~ 



^96 



3.1e-41 



Protein name 

Description 
BJslTA- GALAOTOS , { LACTASE ) 



Locus Name 



sp:fi<3AL_THETU 



Acc# 



P26257 



NT 



AA 



ORF Name 



NTID 



21L2S.9.16....a±..±2L. I RT72F 



AAID Length Length 



TT7T 



Score Probability 
TTZ 



6.8e-19 



Protein name 



Locus Name 



unknown 



|gp:AF141M2 



Acc# 



AF141932 



Description 



Rhizobium leguminosarum bv. trifolii plasmid PRlel62Yl0C rspDEFoperon, 
partial sequence. 



1226 



NT 



AA 



ORF Name 



2425943S cl 12b 



NTID AAID Length Length 

9949 



Protein name 



protein kinase, , cGMP - dependent 



Description 



FIB" 



Score Probability 
0.025 



Locus Name 



|pir:B28269 



Acc# 



B28269 



ORF Name 



NTID 



AAID 



NT AA 

— , — Score Pro bability- 
Length Length : 



2Li3.3.5.io.i..±2...i3. I irm 



9950 



T2TT 



Protein name 



Description 



Locus Name 



lsp:LCFH_HAi4IN 



Acc# 



P44446 



ACYL-COA SYNTHETASE) (LAM) 



V3 



SI 



if "1i 

jjii liiij. 



ORF Name 



2AS.26.&.7£...t3....6.3.... 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



WTZT 



Length Length 



Score Probability 



1410 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — 



T5TT 



Locus Name 



Acc# 



Description 
FFTTTT 



ORF Name 



NTID 



AAID 



l 26.5.$.5.0£l..±L.±5. I WT5T 



Protein name 



hypothetical protein MTH14bl 



Description 



NT 



AA 



Length Length 
TTU2 — 



Score Probability 
i.7e-20 



Locus Name 



pir :C69060 



Acc# 



C69060 



1227 



ORF Name 



29462801 tl 2 



Protein name 



NT AA 

— * , — , Score Probability 
NTID AAID Length Length — *• 



WTJT 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



NT AA 

— , — T Score Probability 
AAID Length Length 



TTTTT 



TTT 



8 . Oe-26 



Protein name 



Locus Name 



putative aipna~L-ara£>inoturanosiclase 



gp:ATAC011708 



Acc# 



AC011708 



Description 



AraJDidopsis thaliana chromosome III BAC T7M13 genomic sequence, complete 
sequence . 



ORF Name 



NT AA 

— , — , Score Probability 
NTID AAID Length Length JL 



!&±1&10£..±1...±A I WJT% 



9956 



Ttt 1 ITTTS 



5.3e-07 



Protein name 
Description 

PORIN P PRECURSOR (OUTER MEMBRANE PROTEIN D1J 



Locus Name 



sp:P0RP__P^EAliJ 



Acc# 



P05695 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^~ 



9957 



ST" 



TTF" 



l.Ie-05 



Protein name 



Locus Name 



Styrene sensor Kinase 



gp:PSSTYCATA 



Acc# 



AJ0OO33O 



Description 

Pseuclomonas sp. DNA tor styrene catabolism genes. 



1228 



NT 



AA 



ORF Name 



134406517 c2 127 



NTID AAID Length Length 



1119 1 13360 



Score Probability 



Protein name 



Locus Name 



receptor antigen tRagA) 



(gp:Pail30872 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W5 0 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



34410751 t3 75 



T7T7 



NTID AAID Length Length 
93^3 



Score Probability 
T7Te^T7 



Protein name 



Locus Name 



unknown 



gp:Ai?0073§l 



Acc# 



AF007381 



Description 



Flavobacterium johnsoniae gliding motility protein (glclA) gene, complete 
cds; and unknown genes. 



NT 



AA 



ORF Name 



NTID 



aaft7.2£i..±a...&6 i pi 



AAID Length Length 



Score Probability 
Ii.ie~i27 



Protein name 



Locus Name 



hypothetical protein SCF34.07 



pir :T364Ub 



ACC# 



T36406 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



&tl6ALl...czl..±15. I F71¥ 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



1229 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



'424193 t3 71 



PT7W 



■1.2e-68 



Protein name 



Description 



Locus Name 



Acc# 



sp:RF2_J!C0LI 



£E£T1£)E CHAIN RELEASE FACTOR 2 (RF-2) 



ORF Name 



NTID 



AAID 



14330312 cl 58 



Protein name 



hypothetical protein 



Description 



NT AA ^ n , , , , _ 
T — . , T — Score Probability 
Length Length 



JUT 



Locus Name 



plr :T33724 



Acc# 



T33724 



ORF Name 



Protein name 



NTID 



AAID 



9964 



NT 



AA 



Length Length 
T35~ 



Score Probability 



Locus Name 



Acc# 



Description 
EKFUTT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



14743 



99GB 



Length Length 



1545 



Score Probability 
1.7e-59 



Locus Name 



sp:HBXAJ»R<31 



Acc# 



P49008 



(BETA-NAHASEJ 



1230 



NT 



AA 



ORF Name 



NT ID 



b^7787 c2 131 



AAID Length Length 
— 



1215 



Score Probability 
3.6e-37 



¥011 



Protein name 



Locus Name 



unsaturated glucuronyl hydrolase 



gp:AB019619 



Acc# 



AB019619 



Description 



Bacillus sp. GLl genes tor ort and unsaturated glucuronylhydrolase, 
complete cds . 



NT 



AA 



ORF Name 



SMsSO cl 94 



NTID AAID Length Length 




11425 



Score Probability 
UST 



2.6e-42 



Protein name 



Locus Name 



adenylate cyclase 



gp:D89625 



Acc# 



D89625 



Description 



Anabaena sp. cyaC gene tor adenylate cyclase, complete cds. 



NT 



AA 



ORF Name 



i£ftum...ci..iafi i witz 



NTID AAID Length Length 

mm — 



T7TT 



1112 



Score Probability 




2.7e-22 



Protein name 



Locus Name 



probable succinyl - diamxnopimelate 
desuccinylase 



|pir:H70608 



Acc# 



H706 08 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
F7TT 



Score Probability 



Protem name 

Description 
NO-HIT 



Locus Name 



Acc# 



1231 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1376bVbl t-A lb 



14748 



^57" 



3.3e-07 



Protein name 



cytocnrome jd 



Locus Name 
gp:GPA24939b 



Acc# 



AJ249395 



Description 

(Slobodera pallida mitochondrial Uull, NU4, Uolll , ND6, Mul, frDj anacyto 



genes . 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



— S core Probability 
AAIP Length Length 



Locus Name 



Acc# 



NO-HIT" 



ORF Name 



Protein name 



Description 
INO-HIT 



NT 



AA 



NTID 



AAID 



9972 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



14751 



Length Length 



Score Probability 



ITT" 



Locus Name 



Acc# 



INO-HIT 



1232 



NT 



AA 



ORF Name 



NT ID 



22848775 cl 23 



4752 



— — T — _ Score Probability 
AAID Length Length J - 



W7¥ 



3T" 



10 . 00035 



Protein name 



Locus Name 



|sp:DNU4_M0£U 



Acc# 
P15017 



Description 

PRdfiAfiLfi TRASfSCftlWIOSAL kfiOTLATOfc IN ATPASS CP{0) (URP4) 



NT 



AA 



ORF Name 



NTID 



AAID 



123444088 Cl 24 



Length Length 
£"53 — ; 



Score Probability 



$5 



Protein name 

Description 
P3 r TITT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

srrz — 



Score Probability 
75 



Protein name 



Description 



Locus Name 



|gp:MUSIGKBJ 



Acc# 



M13606 



Mouse Ig active kappa-cnam VJ2 mRNA trom HP22.134. 



NT 



AA 



ORF Name 



NTID 



AAID 



\19&5.9£M...±2Jl I F7^ 



wrrr 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1233 



NT 



AA 



ORF Name 



NTID 



AAID 



34570927 rl 2 



937S' 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
INO-HI'l 1 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



lfcUm.7....ca...4ft I 



Length Length 
S3 - 



Score Probability 

m 



0.021 



Protein name 



Locus Name 



hypothetical protein C17F3.3 



pir :T32879 



Acc# 
T328.7 9 



Description 



NT 



AA 



ORF Name 



|A162S.lS...±2...1ft I WT5$ 



NTID AAID Length Length 

19530 



fTJT 



Score Probability 

m 



0.021 



Protein name 



Locus Name 



conserved hypothetical protein BBI40 



pir:G70244 



Acc# 



G70244 



Description 



NT 



AA 



ORF Name 



NTID 



l 5£5.9£A0...±2...$. I FTT^r 



AAID Length Length 
55BI — 



11269 



Score Probability 




0.00S5 



Protein name 



Locus Name 



unKnown 



gp:AP035S5§ 



Acc# 



AF033858 



Description 



lectio co ecus pentosaceus strain ATCC43200 plasmid pMD136, compl e teplasmrd 
sequence . 



1234 



ORF Name 



NTID 



NT AA 

— , — . Score Pro babi lity 
AAID Length Length JL 



llVtyJVb c2 42 



IMF" 



|4.1e-25 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PSI130a'72 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen Tragi locus encociinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



l657$3$S c3 46 



AAID Length Length 
WW! 



190 



57T 



Score Probability 
¥uT 



Protein name 



Locus Name 



sp:Y4K._6HISM 



Acc# 
P55617 



Description 

PUTATIVE INSER T ION SEQUENCE A T P -BINDING PROTEIN V4PL 



NT 



AA 



ORF Name 



NTID 



AAID 



m£5...cl...i2 1 



Length Length 
^5 



Score Probability 



267 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



12L5.0.ilS.D...±3....2S. I 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length — — 



Locus Name 



Acc# 



Description 
INO-HIT 



1235 



ORF Name 



34181512 c2 38 



Protein name 



NT ID 



14764 



NT 



AA 



AAID Length Length 




Score Probability 



TUT 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



3.6.4.2.3.;7.6.3.,..Cl...3.4. 



Protein name 



Description 



NTID 



NT AA 

— ■ — , Score Probability 
AAID Length Length JL 



IT 



F5~ 



0.011 



Locus Name 



gp:ATAC011020 



ACC# 



AC011020 



Arabidopsis thaliana chromosome I BAC F12B7 genomic sequence , complete 
sequence . 



ORF Name 



NTID 



£fiaft£fi2L±l..A I WT£% 



9938 



Protein name 



probable sigK protein 



Description 



NT 



AA 



AAID Length Length 
T73 



Score Probability 

T7 — : — 



Locus Name 



pir :F70830 



O.OlS . 



Acc# 



F70830 



ORF Name 



Protein name 



NTID 



[4757 



AAID 



-www 



NT AA 

— , — , Score Probability 
Length Length 



T7T" 



Locus Name 



Acc# 



Description 
IMO-HiT 



1236 



NT 



AA 



ORF Name 



NTID 



55151 c2 41 



AAID Length Length 
WD 1 



9^90 



Score Probability 

— 



7 . 4e-40 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



14753 



9391 



Length Length 



11803 



Score Probability 
553 



l. ye-54 



Protein name 



Description 



Locus Name 



|sp;BgAL_THBW 



Acc# 



P26257 



BETA- GALACTOS IDASE , ( LACITA^E ) 



NT 



AA 



ORF Name 



NTID 



AAID 



HQa3,lR8..7.„.a3....3.a 



9992 



Length Length 
TFZS — 



Score Probability 



374 



TUTT 



6 . 5e-102 



Protein name 



Locus Name 



CDP-glucose-4 , 6 -dehydratase 



pir :D47070 



Acc# 



D47070 



Description 



ORF Name 



NTID 



NT AA 

— , ^ — , Score Probability 
AAID Length Length — 



±£6A0£15....cl...l± 



^771" 



l.de-95 



Protein name 



Locus Name 



CDP-tyvelose epimerase 



Acc# 



U29691 



Description 



Yersinia pseudotuberculosis group 
IVACDP-4-keto-6-deoxy-D-glucose-3-dehydrase (ddhC) gene, partial 
cds, CDP-paratose synthetase (prt) and CDP-tyvelose epimerase (tyv) genes, 
complete cds, and putative O antigen export protein (wzx)gene, partial cds . 



1237 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2031^500 cl 22 



VTTT 



19594 



II . 2e-18 



Protein name 



Locus Name 



Acc# 



clTDP-giucose 4 , 6 - deny drat as e 



pir:H69105 



H69105 



Description 



ORF Name 



NTID 



Protein name 



20.5.0.7.213....c3....2i^ I WTH 



AAID 



NT 



AA 



Length Length 
S3 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



NTID 



WTVT 



AAID 



NT 



AA 



Length Length 
Si- 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



3.31Sil..±l...S 



Protein name 



Description 



INO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



TUT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA ^ n _ ... 
— , — , Score Probability 
Length Length 



A5.17.aib.0....al...2Lfl. I WTTZ 



4.ie-4b 



Protein name 



Locus Name 



glucose- 1 -phosphate cyt lciyiyl trans t erase , 



pir :C47070 



Acc# 



C47070 



Description 



1238 



NT 



AA 



ORF Name 



cl 11 



WTT7 



NTID AAID Length Length 

9999 



Protein name 



115K outer memorane protein precursor : SusC 
protein 



Description 



2538' 



Score Probability 
WT*> 



Locus Name 



pir: JC6027 



U.4e-66 



Acc# 
JC6027 



NT 



AA 



ORF Name 



NT ID 



IM&0.3.5.2....C.3....U I |T77ff 



AAID Length Length 
10000 



Score Probability 
3^5 



8 . oe-41 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



Description 



pir : JC6 02 7 



Acc# 



JC6027 



ORF Name 



10.£3.5.3.3.S...±i...l5.. 1 



Protein name 

Description 
MO-HIT 



NTID 



NT AA 
T — ^, T — ^, Score Probability 
AAID Length Length " L 



10001 



1260 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



NT AA 
„ — , — , Score Probability 
AAID Length Length — - J - 



10002 



TIT 



2.5e»0g 



Locus Name 



gp:D86y34 



Acc# 



D86934 



Description 

Staphylococcus aureus genes, mec region, partial and complete cas . 



1239 



NT 



AA 



ORF Name 



NTID 



AAIP Length Length 



Score Probability 



12258437 c2 330 



10003 



|1.7e-21 



Protein name 



Locus Name 



sp:YYAM_BACSU 



Acc# 



P37511 



Description 

KYPOTHEl'tCAL 32.$ KD i>R0TBIN IN f 12TB-JSX0A INTE&GfiNIC REGION 



ORF Name 



NTID 



NT AA o ^ ^ . , _ , ^ 
T — _ — ^, Score Probability 
AAIP Length Length ^ 



13S65675 cl 304 



110004 



6.2e-97 



Protein name 



Locus Name 



homoserine o-succinyltransterase 



pir :C72324 



ACC# 



C72324 



Description 



ORF Name 



NT AA 

XTmM .^ ^-r^ T — ^ T — , ■ Score Probability 
NTID AAIP Length Length ^ 



m£aft££.,±a...2&a I ftst 



Protein name 



110005 



TTF 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NT AA 

— , — Score Probability 
NTID AAIP Length Length JL 



Protein name 



10005 



Mi 



Locus Name 



Acc# 



Description 
IWO-HM 



1240 



ORF Name 



Protein name 

Description 
[NO-HIT 



NT 



AA 



NTID 



AAID 



10007 



Length Length 




Score Probability 



35 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



14££7.7.D.D....al...3.0.1 



10008 



conserved hypothetical protein 



Description 



NT AA 

— • — , Score Probability 
Length Length ^ 



1929 



155 



Locus Name 



tpir:E75439 



2 .3e-10 



Acc# 



E75439 



W 

I is? 
(ill 

0 

If J 

!!!;:: 

fl 
,[([ jS3, 



ORF Name 



|1S.7.0J.2.1S..±3.,..26.6.. < 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



10009 



Length Length 



Score Probability 



rsir 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
r — , — ^ Score Probab ility 
Length Length — — *- 



|li^^..±1...4.1 I 



10010 



7¥" 



6 . 3e-iv 



Protein name 

Description 
FERRED0X1K1 



Locus Name 



sp : FER_BUTME 



Acc# 



P14073 



1241 



ORF Name 



N'T ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ^ 



16615912 r3 261 



110011 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length J ~ 



FT7W 



10012 



431 



TTZT 



2-.3e-177 



Protein name 



Locus Name 



hypothetical protein 



pir : jgi020 



Acc# 



JQ1020 



Description 



ORF Name 



NTID 



NT AA 

- — , — , Score Probability 
AAID Length Length 



13.6.B.y.aZ5....al...3.3A.. 



FT75T 



10013 



Protein name 



hypothetical protein F10M10.30 



Description 



I-5e-39 



Locus Name 



pir :T04772 



Acc# 



T04772 



ORF Name 



NTID 



ia.7.inaai..±i...aa I wrwi 



10014 



NT 



AA 



AAID Length Length 
551 



Score Probability 
75 



0.012 



Protein name 
Description 

Toxocara cams TcH SLdT.46 0 mRNA, complete eels . 



Locus Name 



bp:TCUS4725 



Acc# 



U64729 



1242 



nprr N^m^ NT ID AAID 


NT AA 
Length Length 


Score 


Probability 


19770(JB6_c2_351 4793 10015 


1170 


652 


7.le-64 


Protein name 


Locus Name 


Acc# 


potassium- dependent ATJ^ase suounit JJ ■ 


| gp:At'!ilJ4bb 


AF213466 


Description 




" Anabaena sp . 1.-31 Kap operon, complete 


sequence . 








off N^mP. NTID AAID 


NT AA 
Length Length 


Score 


Probability 


2OTT3633_±i_lc> 4794 10016 


191 576 


144 


" 4.1e-20 



Protein name 



Description 



Locus Name 



sp:£NUCjbiALTY 



Acc# 



P24520 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



10017 



Length Length 
TSF 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



4796 



AAID 



10018 



NT AA 
Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



22305532 t2 123 



14757 



AAID Length Length 
10014 



ST0~ 



Score Probability 
STS 



6 . 0e-S7 



Protein name 



hypothetical protein 



Locus Name 
|pir;g76075 



Acc# 



S76076 



Description 



ORF Name 



NTID 



NT AA 
„ — , — , Score Probability 
AAID Length Length — JL 



FT7W 



10020' 



ST" 



ST - 



0.031 



Protein name 



Description 



Locus Name 



sp:SPRC_XENLA 



Acc# 



P36378 



(OSTEONECTIN) (ON) (BASEMENT MEMBRANE PROTEIN BM-40J 



NT 



AA 



ORF Name 



NTID 



2Z&6.a&.3.6....C.3....43.:L 



AAID Length Length 
10021 



|¥TET 



Score Probability 
|4.0e-86 



Protein name 



Locus Name 



probable phosphonopyruvate decarboxylase, T" 



pir:D55154 



Acc# 



D69154 



Description 



ORF Name 



NTID 



AAID 



NT AA 
T — , T — , Score Probability 
Length Length — 



21&26.1±±...al..A0A 



10022 



TOT" 



Protein name 



Locus Name 



potassium- transporting ATPase, B subunlt 



pir:A7BS27 



Description 



6 . 4e-22l 



Acc# 



A75627 



1244 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



'23632«75 c3-403 



110023 



Score Probability 
|6.be-125 



1228 



Protein name 



Locus Name 



potassium- translocating ATPase A chain 



|gp;AAC243194 



Acc# 



AJ243194 



Description 



Alicyciobacillus acidocaldarius JcctpA gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



23960013 c3 41$ 



10024 



Length Length 



Score Probability 
2.4e-ilS 



53? 



Protein name 



Locus Name 



putative secreted protein 



gp:SCF41 



Acc# 



AL117387 



Description 

Streptomyces coeiicolor cosmict F41. 



NT 



AA 



ORF Name 



NTID 



AAID 



i&m:i:±Li...a±Ji$:± i 



10025 



Length Length 



— Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— _ — , Score Probability 
Length Length J - 



14504 



10026 



TTT 



1002 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1245 



ORF Name 



NTID 



NT AA 

_ TT ^ _ — T — _ Score Probability 
AAID Length Length 



24322712 cl 295 



[¥5775" 



10027 



1152 I [IT^ 



T5T" 



5 . 3e-132 



Protein name 



Locus Name 



115K outer membrane protein precursor .* SusC 
protein 



pir : JC6 02 7 



Acc# 



JC602 7 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



10028 



Length Length 



Score Probability 



Protein name 

Description 
IKIO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



M6A0.3.15...±2...10.6 1 PT7 



10029 



Protein name 



hypothetical protein rjnpl2ll 



Description 



NT 



AA 



Length Length 



Score Probability 
P^T" — 



Locus Name 



3 , Oe-26 



ACC# 



C71832 



ORF Name 



NTID 



AAID 



\2±6A1&±&..±1..±&.1 1 



10050 



Protein name 
Description 

S.scxurx mecAl gene, strain K3 UVUVI2 ) 



NT AA 

— , , — , Score Probability 
Length Length — — 



55" 



Locus Name 



gp:SSK3MECAl 



3.£e-05 



Acc# 



Y13052 



1246 



# 



ORF Name 



24S04691 cl 2<>5 



Protein name 

Description 
tWO-MIT " 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



110031 



T7T" 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



10032 



F7T" 



Locus Name 



aspartate .kinase, / nomoserme dehydrogenase, 
T16H5 . 70 rprotein T16H5 . 70 :protein T16H5,70 



pxr :T047b2 



Description 



i.2e-58 



Acc# 



T04752 



NT 



AA 



ORF Name 



NTID 



4811 



AAID Length Length 
10033 



Score Probability 

Tn — 



1..5e-39 



Protein name 



Locus Name 



VicK protein 



|gp:EPA01i>0b0 



Acc# 



AJ012 050 



Description 



Enterococcus faecalis vie operon and rianJcing genes. 



ORF Name 



NTID 



AAID 



NT AA 
t t Score Probability 

Length Length 



|2Sm_.ai..All I 



110034 



7W 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1247 



# 



NT 



AA 



ORF Name 



'i> Si 7712 cl 333 



NTID 
4813 



AAID Length Length 
10035 



ISTT5" 



Score Probability 
TZ7 



1.8e-i2 



Protein name 



Locus Name 



Hypothetical protein S110687 



p±r:S74416 



Acc# 



S74416 



Description 



ORF Name 



NTID 



NT AA 

^ ^ x — _ — ^. Score P robabi lity 
AAID Length Length * L 



2b.&3.£M2....c2...3.8.5. I 



110036 



T7HT" 



T3T" 



7.6e-06 



Protein name 

Description 
FECR PROTEIN 



Locus Name 



sp : FECR^ECOLI 



Acc# 



P23485 



NT 



AA 



ORF Name 



2S.M5.&8.1...C3....417. J prers- 



NTID AAID Length Length 

10037 



RTUT" 



Score Probability 
WJ — 



6 . Oe-99 



Protein name 



Description 



Locus Name 



sp : AAT_BACST 



ACC# 



Q59228 



ASPARTATE AMINOTRANSFERASE, (TRANSAMINASE A) (AS PAT) 



NT 



AA 



ORF Name 



NTID 



ia.?.2Lflft2Li...ci...3ii I Fsrs 



AAID Length Length 
10035 



5TT 



Score Probability 
7F5 



2 . 8e-78 



Protein name 
Description 

PROBABLE ASPARTOKINASE, (ASPARTATE KINASE) 



Locus Name 



sp:AK METJA 



Acc# 



Q57991 



1248 



• 



NT 



AA 



ORF Name 



3001402 t3 222 



NT ID 




AAID Length Length 
10035 



Score Probability 
ITT5 



1.9e-06 



Protein name 



Description 



Locus Name 



|gp .-50740155 



Acc# 



U40158 



Staphylococcus carnosus response regulator-like protein ( ortxj gene , partial 
cds . 



NT 



AA 



ORF Name 



30656255 cl 254 



NTID AAID Length Length 
10040 



Score Probability 
3TB 



2 . 9e~28 



Protein name 



Locus Name 



RNA polymerase sigma t actor SigZ-like protein 



|gp:AFl37263 



Acc# 



AF137263 



Description 



Bacteroides tnetaiotaomicron 30S ribosomal protein ,sl6-likeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds. 



NT 



AA 



ORF Name 



3,U6.6.16.8.2...t2....183..... I 14819 



NTID AAID Length Length 

10041 



TSZ 1 [T¥ST 



Score Probability 




8 . 2e-35 



Protein name 



putative aspartate kinase 



Locus Name 



|gp:ATAC0107S7 



Acc# 



AC010797 



Description 



Arabidopsis thaliana chromosome III BAC F28J7 genomic sequence , complete 
sequence. 



ORF Name 



NTID 



AAID 



NT AA 

— , — . Score Probability 
Length Length ■ 



\10120A0±..c2..3A$. I 



10042 



1UT 



8 . Oe-33 



Protein name 

Description 
C CHAIN) 



Locus Name 



sp:ATK.CJ_MYCTU 



Acc# 
P96369 



NT 



AA 



ORF Name 



NTID 



3125011 r3 223 



4821 



AAID Length Length 
10043 



811 



Score Probability 
ll.le-29 



Protein name 



Locus Name 



|gp:AP12620i 



Acc# 



AF126201 



Description 



Pseuctomonas putida strain S-313 sultate ester desulf urization geneiocus, 
complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



3132040 12 120 



10044 



Length Length 



Score Probability 
6 . 3e-72 



Protein name 

Description 
(L-ASMAaE I) 



Locus Name 



sp:ASGiJSC0LI 



Acc# 



P18840 



NT 



AA 



ORF Name 



NTID 



3.2.18.8....J12...LS.&. 



AAID Length Length 
10045 



2511 



Score Probability 

wnr — 



1 . le-97 



Protein name 



Locus Name 



NADH oxidase (noxA- 3 ) homo log 



pirTTT^W' 



Acc# 



H69299 



Description 



NT 



AA 



ORF Name 



NTID 



S.3.4S.g.£0.2....a3....40.2. I W&I% 



AAID Length Length 
10046 



TUTT 



Score Probability 
0.00068 



114 



Protein name 



Locus Name 



transmembrane sensor 



gp:AP0516'91 



Acc# 



AF051691 



Description 



Pseuclomonas aeruginosa stress tactor A (pstA) , ecf Sigma tactor (tiuij , 
transmembrane sensor (fiuR) , and hydroxamate- typef errisiderophore receptor 
fiuA) genes, complete cds . 



1250 



NT 



AA 



ORF Name 



'3B24b63i> c3 410 



NTID AAIP Length Length 

10047 



FIT 



Score Probability 
2 .4e-09 



Protein name 



Locus Name 



unKnown 



gp:U96771 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase , B-l , 4-endogIucanase , and. 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



36375662 cl 323 



10048 



Length Length 
TTZZ — 



T5T 



Score Probability 
333 



2 . 8e~94 



Protein name 



Description 



Locus Name 



sp:HBK2_HAl*IH 



Acc# 



P44503 



THRKOUINB SYN T HASE, 



NT 



AA 



ORF Name 



NTID 



3.9.12.6.6.3....£2.„.12.2L.. 



AAID Length Length 
10049 



TJWT 



Score Probability 
T2T2 — : 



2 . le-126 



Protein name 



Locus Name 



sp:RADA_BACSU 



Acc# 



P37572 



Description 

DMA REPAIR PROTEIN RaDA HOMOLOfl (r>MA kttPhlk PROTEIKf SMS H0M0L0G) 



NT 



AA 



ORF Name 



NTID 



AAID 



3.a3.9.3.&7....a3„..i3..7.. 



10050 



Length Length 



Score Probability 
3.6e-53 



Protein name 



Locus Name 



putative 30.6 KDa protexn 



gp:AF037440 



Acc# 



AF037440 



Description 



Eclwardsiella ictaluri D-3 -phosphoglycerate denyclrogenase tserA) gene, 
partial cds; ribose- 5 -phosphate isomerase (rpiA) , inhibitorof chromosome 
initiation (iciA) , putative 26 kDa protein (yggE) , putative 30.6 kDa protein 
(y^gB) , and fructose 1 , 6-bisphosphatealdolase (fda) genes, complete cds; and 
phosphocrlycerate kinase (perk) gene, partial cds. m 



1251 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2942202 c2 229 



E 



WIT 



10051 



TT7TT 



Protein name 

Description 
ARYLStfLFATASE P E>ftSCflftSOft, (ASP) 



Locus Name 



sprAR^MlUMAN 



Acc# 
P54793 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



'4078255 Cl 281 



10052 



2 .4e-47 



Protein name 



Locus Name 



tripeptidyl ammopeptidase 



Igp : STMTPAP 



Acc# 



L46588 



Description 



Streptomyces lividans tripeptidyl ammopeptidase gene, completecds . 



ORF Name 



NTID 



NT AA 

— , — • „ Score Probability 
AAID Length Length 



10053 



1974 



Protein name 



Locus Name 



Acc# 



Description 
INC -HIT 



ORF Name 



NT 



AA 



NTID 



Al±l&B£....z±..M.9. J K&n 



AAID Length Length 

— 



10054 



Score Probability 
859 



Protein name 



Locus Name 



response regulatory protein (rrp-2) homolog 



pir :B70195 



Description 



Acc# 



B70195 



1252 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



4554087 c2 3^9 



TS1T" 



10055 



HT5" 



T2TT" 



2.5e-123 



Protein name 



Locus Name 



GTP cyclohyarolase II, / 3 , "~ 
4-dihydroxy-2-butanone 4 -phosphate synthase, 
rihA:ribA protein 



pir:C70331 



Acc# 



C70331 



Description 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



4SS1507 c3 439 



10055 



2880 



WIT 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



Description 



toiriJtibO'A 1 / 



Acc# 



JG6027 



ORF Name 



NTID 



AAID 



— — Score Prob ability 
Length Length 



10057 



TT7F 



3.1e-13b 



Protein name 
Description 

PUTATIVE PRO T EAN ¥ DC1> ^kUcjUkriuk, 



Locus Name 



Acc# 



sp:YDCl>JillloLl 



NT 



AA 



ORF Name 



NTID 



i5m&iiL.ti...a I 



AAID Length Length 
I1005& 



[7TTT 



Score Probability 
H2T" 



|3.5e.-08 



Protein name 



Locus Name 



heme receptor 



|gp:VIBHUTjT 



Acc# 



L27149 



Description 

Vibrio cholerae heme receptor (hutA; gene, complete cas , 



1253 



ORF Name 



NTID 



5272312 C2 350 



Protein name 



110059 



hypothetical protein Rv0587 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



pir:P70907 



o.oorar 



Acc# 



F70907 



ORF Name 



NTID 



AAID 



NT AA n „ 
— ^ — L1 Score Probability 
Length Length 



5.27.M2l5..±2...13.4 1 [^TF 



10060 



FT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



5.2.B.Q.9.12...X3....2.2.X I 14839 



Length Length 
EST 



Score Probability 



fTTF" 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



110062 



Length Length 

rnrr 



Score Probability 



Protein name 

Description 
INO-HIT " 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1254 



ORF Name 



NTID 



AAID 



636780B ±3 272 



4842 



10064 



Protein name 



outer membrane protein 21, Omp2i 



Description 



— — Score Probability 
Length Length • 



639 



Locus Name 



0.010 



Acc# 
AJ001918 



Comamonas aciaovorans omp2l gene. 



ORF Name 



NTID 



AAID 



^ — Score Probabi lity 
Length Length • 



64171^2 c3 42U 



Protein name 



TO" 



10065 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



13.7.£.7.1&D...±1...Z3... 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



60 



TFT" 



7TT 



Locus Name 



U70TT 



Acc# 



hypothetical protein APE1598 



Description 



lpir;A72b3* 



A72539 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



10067 



Locus Name 



0.042 



Acc# 



hypothetical protein URFbv 



Description 



bir:T30436 



T30436 



1255 



NT 



AA 



ORF Name 



NTID 



16603402 c2 7& 



AAID Length Length 
10068 



JTT 



TUTT 



Score Probability 
I . ue-94 



Protein name 



Locus Name 



WbnF 



gp:AF172324 



ACC# 



AF172324 



Description 



Escherichia coll GalF (galF) gene, partial cds; o-antigen repeatunxt 
transporter Wzx (wzx) , WbnA (wbnA), O-antigen polymerase Wzy(wzy) , WbnB 
(wbnB), WbnC (wbnC) , WbnD (wbnD) , WbnE (wbnE) , UDP-Glc-4 -epimerase GalE 
(galE) , 6-phosphogluconate dehydrogenaseGnd (gnd) , UDP-Glc- 6 -dehydrogenase 
Ugd (ucrd) , and WbnF (wbnF) genes , complete cds; and chain length determinant 



NT 



AA 



ORF Name 



NTID 



AAID 



l21£.7.aS.0.0....cl....6.4 J 



10065 



Length Length 
FT7T" 



Score Probability 
B.iSe-132 



TT5T 



Protein name 



Locus Name 



3~isopropylmalate dehydratase, large chain 



pir :T2y083 



Description 



Acc# 



T29083 



NT 



AA 



ORF Name 



NTID 



AAID 



m£Z±£2i...c2...±4 j mrs 



10070 



Length Length 



Score Probability 
$.7e-44 



Protein name 



Description 



Locus Name 



sp:LEUD_HAEIN 



ACC# 



P44438 



(ISOPRO^yLM&lAT B I^OMKRASIE) (ALPHA- IPM l^OMUkA^lil) 



NT 



AA 



ORF Name 



NTID 



AAID 



\2X6MA25....Ql...hh J pil 



10071 



Length Length 
T57~ 



i7T7¥~ 



Score Probability 
TWIE 



2.4e-189 



Protein name 

Description 
(IMDH) (3-IPM-DH) 



Locus Name 



spiLWUJJiAilJ'R 



Acc# 



P54354 



1256 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



24270450 Cl 67 



TUUTT 



ia.Oe-14 



Protein name 



Locus Name 



unknown 



gp:AF036677 



Acc# 



AF036677 



Description 



Salmonella typhimurium putative operon regulated by PmrAB, necessary tor 
4-aminoarabinose lipid A modification and polymyxinresi stance, PmrG (pmrG) 
gene, partial cds; PmrF (pmrF) gene and Gorfs, complete cds; and PmrD (pmrD) 
gene, partial cds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



T5T 



Score Probability 
ITUS 



S.le-OS 



Protein name 



Locus Name 



hypothetical protein PH0219 



pir:A71245 



Acc# 



A71245 



Description 



NT 



AA 



ORF Name 



NTID 



24:U.7..7.S.5....cl...£a.... I 15552" 



AAID Length Length 
10074 



1032 I 13096 



Score Probability 
FT7 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusG 
protein 



pir : JC6027 



Description 



2.6e-S6 



Acc# 



JC6027 



ORF Name 



NTID 



AAID 



NT AA 

— , „ - — , Score Probability 
Length Length 



\2&.21b.&..±ZA2 1 



10075 



Protein name 



Locus Name 



Acc# 



Description 

MO-HIT 



1257 



ORF Name 



NT ID 



.*, TT . — , — Score Probability 
AAID Length Length 



34081405 c2 71 



H0076 



sir 



2FT 



TTTT 



I . ye-06 



Protein name 



Locus Name 



nypotneticax protein PHS004 



|pir:P7i245 



Acc# 



F71245 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3A4.0..7.a3..7....G2....:U 



10077 



Length Length 



Score Probability 
T27T7 — 



l.ie-122 



Protein name 



Locus Name 



sp:LEUl_HAEIN 



Acc# 



P43861 



Description 

SVMTHAaM) (ALPHA- TPM SYNTHETAyii!) 



ORF Name 



NTID 



NT AA . ^ , . 
~ __ T — _ T — _ Score Probability 
AAID Length Length : aL 



10078 



2.3e-65 



Protein name 



Locus Name 



2-isopropylmaIate synthase (IeuA-1) nomolog 



pir;E69369 



Acc# 



E69369 



Description 



ORF Name 



NTID 



NT AA 
, — , — , Score Probability 
AAID Length Length 



5.3.3.2l5D.£..±I...2.4. I 



110079 



7T 



0.034 



Protein name 



Locus Name 



nypotneticax protein PH0220 



pir :B71245 



Acc# 



B71245 



Description 



ORF Name 



Protein name 



NTID 



10080 



lipid A di saccharide synthase 



Description 



NT 



AA 



AAID Length Length 



^7" 



Score Probability 
2.0e-lfl 



Locus Name 



pir:B72014 



Acc# 



B72014 



1258 



ORF Name 



NT ID 



NT AA _ _ , , . _ . . 
— — Score Probability 
AAID Length Length 



5501877 c2 76 



10051 



1553" 



Protein name 



Locus Name 



do 1 i c no i - pno spna t e manno syitransterase 



Description 



fpirTTTTMFT 



|1.4e-33 



Acc# 



G70463 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



10082 



86 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



±0±6A6XL.alJl&$. 



10083 



Protein name 



prolme-rich protein precursor 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
1.7e-l2 



T7TT 



Locus Name 



pir :S23737 



ACC# 



S23737 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length / Length 

wz§ — 



Score Probability 



10084 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



m£^o.2...c&...2£8. I 



1008S 



Length Length 
^TT5 



TUT 



Score Probability 
1.4e-65 



TTT 



Protein name 



Locus Name 



arabmogalactan-iiKe protein 



pir :Sb^994 



Acc# 



S52994 



Description 



NT 



AA 



ORF Name 



NTID 



12500153 t'2 42 



AAID Length Length 
1008S 



ITS" 



Score Probability 
TTA 



4 .4e-07 



Protein name 



Locus Name 



Hypothetical protein Rv3 864 



pir:E70556 



Acc# 



E70656 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
10057 



12054" 



Score Probability 
1 |3.2e-40 



Protein name 



Locus Name 



receptor antigen (RagA) 



bp:Paii308'n- 



Acc# 
AJ130872 



Description 



Porphyromonas gingivaiis W50 receptor antigen trag) locus encociinga ma]or 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — 



liiduO.Mo.„±i.„iis.... I mzz 



110085 



JIT 



Protein name 

Description 
pKFTTTT — 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT 
Length 



AA 
Length 



— Score Probability 



10085 



TTT 



Protein name 

Description 
WO-HIT 



Locus Name 



Acc# 



1260 



NT 



AA 



ORF Name 



NTID 



TOFT 



AAID Length Length 
10030 



Score Probability 



11236 



Protein name 

Description 
BTO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



I45a3.m...a^..m i 



AAID Length Length 
10031 



Score Probability 
27U 



4.be-22 



Protein name 



Description 



Locus Name 



Acc# 



|sp;TRC4 EOTLI 



DMA &R1MASB TRAC, (REPLICATION PRIMASE) 



'ii !!::;■ 

in 



ORF Name 



NTID 



AAID 



NT AA 

— ■ , — , Score Probability 
Length Length — jL ~ 



4870 



10032 



TT3™ 



B.Se-OS 



Protein name 

Description 
DEHYDRIN RAM 8 



Locus Name 



sp:DHl&_ARATH 



Acc# 



P30185 



ORF Name 



NTID 



NT AA 

— , — - Score Probability 
AAID Length Length JL 



|15.0.3.£6.3.2...al...l6.2... 



[4871 



10033 



TuT 



0.036 



Protein name 



Locus Name 



exodeoxyribonuclease V, gamma chain (recC) 
homolog 



pir :A70179 



Acc# 



A70179 



Description 



1261 



NT 



AA 



ORF Name 



NTID 



AAID 



I202S0 ±3 137 



10094 



Length Length 
1 ITT7T5 — 



Score Probability 
W9 



0 .041 



Protein name 



Locus Name 



unknown 



|gp:U96771 



Acc# 



U96771 



Description 



Prevoteiia Joryantu putative polygalacturonase, B-l, 4- endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



;Ji{ if* 



NT 



AA 



ORF Name 



NTID 



20506501 c2 202 



4873 



AAID Length Length 



10095 



Score Probability 
735 



l .ie-1.9 



Protein name 



Locus Name 



hypothetical protein JD1488 



pir:«45>02 



Acc# 



C64902 



Description 



ORF Name 



NTID 



NT AA 

_ ^ _ — _ T — _ Score Probability 
AAID Length Length — J - 



za5.yASAi...t3....nz. 



10096 



TO" 



Protein name 

Description 
[K^OTT — 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
, — ^ T — ^, Score Probability 
Length Length — i - 



10097 



"3UTT 



TT5~ 



1.5e-08 



Protein name 



Locus Name 



hypothetical protein SC6G4.36C SC6G4.36C 



pir :T35587 



Acc# 



T35587 



Description 



1262 



NT 



AA 



ORF Name 



21507338 11 11 



NTID 
p75 



AAID Length Length 
10053 



Score 



2211 



Probability 
|2.3e~89 



Protein name 



Description 



Locus Name 



gp:BFU630% 



Acc# 



U63096 



Bacteroictes tragi lis (bctAJ gene, complete cds , 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



'21576375 cl 157 



ITOS?" 



T7T 



KIT 



T5T 



Probability 
3.2e-12 



Protein name 



Description 



Locus Name 



gp:AF083424 



Acc# 



AF083424 



Atelme herpesvirus 3 complete genome. 



NT 



AA 



ORF Name 



NTID 



i 2l2l54&1SI..±2....5;L 



AAID Length Length 
10100 



TTT 



Score Probability 
5.2e-06 



Protein name 



Locus Name 



Hypothetical protein T15B7 . 3 



pir :T322bO 



Acc# 



T32250 



Description 



NT 



AA 



ORF Name 



NTID 



2.22a6.5.a6....a3....3.aa., 



AAID Length Length 
±0161 



2649 



Score Probability 
5.4e-23 



Protein name 



Locus Name 



Acc# 



mobilization protein c 



|gp:AF11824i 



AF118243 



Description 

fiacteroides tragilis mobilization protein C (mooo gene, compieteccLs . 



1263 



NT 



AA 



ORF Name 



NT ID 



23486336' cl 146 



AAID Length Length 
10102 



7T 



Score Probability 
1 10.0098 



63 



Protein name 



Locus Name 



R07K5.1 protein (clone ROVES) 



pir :S4iie>04 



Acc# 



S43604 



Description 



NT 



AA 



ORF Name 



23,saaLafi...ai...u4 i 



NTID AAID Length Length 

10103 



Protein name 



Hypothetical protein PH0217 



Description 



TuTT 



TOT" 



Score Probability 

m — 



i.Se-06 



Locus Name 



pir :GV1244 



ACC# 



G71244 



NT 



AA 



ORF Name 



NTID 



I48S2 



AAID Length Length 
10104 



Score Probability 
14$ 



1 . 4e-10 



Protein name 



Locus Name 



spiYMAJ&ACSU 



Acc# 



P50838 



Description 

HYPOTHETICAL 21.1 KD PROTEIN IN COTD-KDUD IN'l'ERSEHIC REGION 



ORF Name 



NTID 



AAID 



NT AA 

— , — . ■■ Score Probability 
Length Length v ^~ 



2126S£.2.1..±1..12 1 P^T 



10105 



TTT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA „ n , , . _ , ^ 
— L1 — J _. Score Probability 
Length Length — JL 



10106 



Protein name 

Description 
IWO-HIT 



Locus Name 



Acc# 



1264 



ORF Name 



24489062 £3 120 



Protein name 



NTID 



14885 



AAID 



110107 



NT AA 

• — , — , Score Probability 
Length Length J ~ 



286 



Locus Name 



Acc# 



Description 
INO-HTT — 



ORF Name 



Protein name 



NTID 



AAID 



10108 



NT 



AA 



Length Length 
TFT 



Score Probability 



536 



Locus Name 



Acc# 



Description 
INO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 
_ — _ — _ Score Probability 
AAID Length Length JL 



10109 



T5T 



Locus Name 



Acc# 



Description 

NO-HIT 



ORF Name 



NTID 



Protein name 



AAID 



10110 



NT AA 

— — Score Probability 
Length Length — - u - 



57T 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



2LasaacLi3.„.ai.„ia2 1 



Protein name 



AAID 



10111 



NT AA 
v — — — L1 Score Probability 
Length Length — J - 



7WF 



Locus Name 



Acc# 



Description 
MO-HIT 



1265 



NT 



AA 



ORF Name 



NT ID 



AAID 



'25627153 i:3 143 



14590 



10112 



Protein name 

Description 
NO-HIT 



— ^, , — , Score Probability 

Length Length ■ ^ 

75 1 [2*22 

Locus Name Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2Ka.7.aai2..±i...io i itost 



10113 



Length Length 



T5TT 



Score Probability 
ET5 



0 . 0044 



Protein name 



Locus Name 



antigen 5401 



pir :A60643 



Acc# 



A60643 



Description 



ORF Name 



NTID 



2&&&22La3..„t3L...ia4 1 14 8 92 



Protein name 



chromosome partitioning ATPase So] 



Description 



NT 



AA 



AAID Length Length 
10114 



TTW 



Score Probability 
T75 



2 . Oe-05 



Locus Name 



pir:D75570 



Acc# 



D75570 



NT 



AA 



ORF Name 



NTID 



AAID 



2.&.7.52Ll&2...t2L...&Q.. 



Protein name 

Description 
BTO-HIT 



10115 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
[NO-HIT 



NT 



AA 



NTID 



AAID 



10116 



Length Length 
355 1 IH5W 



Score Probability 



Locus Name 



Acc# 



1266 



ORF Name 



29354192 c2 253 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 
10117 



Score Probability 
0.027 



7^ 



Locus Name 



|gp:S83135 



Acc# 



S83195 



ORF Name 



3136418 ci idi 



Protein name 



NTID 



NT AA 
~_ _ — ^. — . Score Probabilit y 
AAID Length Length • L 



II Oil 8 



Locus Name 



sperm mitochondrial capsule selenoprotein 



Description 



pir :A37199 



I2.3e-0S 



Acc# 



A37199 



NT 



AA 



ORF Name 



NTID 



3.1*^.2<3. 9>li><>t.l>>*>l>-7> in ii 



AAID Length Length 
10119 



Score Probability 
TTE 



|7.5e-06 



Protein name 



Locus Name 



major ampullate tibroin protein 



pir : A36068 



Acc# 



A36068 



Description 



NT 



AA 



ORF Name 



NTID 



3.1^3.2.aD..7....CZ...^3A | 14898 



AAID Length Length 
10120 



TUT" 



TUT 



Score Probability 
|3.8e-06 



TT2 



Protein name 



Locus Name 



KIAA0775 protein 



|gp:AB018318 



Acc# 



AB018318 



Description 

Homo sapiens mRNA tor KIAAU775 protein, complete cds , 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3IS36bbI cl lbb 



10121 



ITT 



3 . ie-27 



Protein name 



Locus Name 



| gp:CB(jl]JPAi4 



Acc# 



Y10436 



Description 

G. burnetii put. genes tor encod ing glucose inhibited divisionprotem a ana 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


333S8568_cl_i4b 


4$00 


10122 


78 


237 






Protein name 








LOCUS 


Name 


Acc# 


Description 
















NO-HIT 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 

Length 


Score 


Probability 


3.3..?.5.2.21b...±2...6.!/. 


4$oi 


10123 


457 


1374 






Protein name 








Locus 


Name 


Acc# 


Description 














MO-HIT I 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lllXSMX^tl^l 


4902 


10124 


240 


723 


111 


"0 . 00096 



Protein name 



Locus Name 



troponin t 



bir:S027od 



Acc# 



S02708 



Description 



1268 



NT 



AA 



ORF Name 



NT ID 



AAID 



34131583 ±3 115 



10125 



Length Length 



Score Probability 



[5W 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



MA0.7.5.7.5...±1...20...„ I mtt 



10126 



Length Length 

?m — 



Score Probability 



Protein name 

Description 
[MO-HIT 



Locus Name 



Acc# 



«;i sii? 

5*1 :ss; ! 
fjj 

If f«l 



NT 



AA 



ORF Name 



NTID 



AAID 



110127 



Length Length 



Score Probability 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



I; -J 



NT 



AA 



ORF Name 



NTID 



AAID 



3.im41fL.c.2L...m I F^TO 



110128 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



INO-HTT 



NT 



AA 



ORF Name 



NTID 



AAID 



10125 



Length Length 
TT3 



Score Probability 



TTu" 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



1269 



ORF Name 



NT ID 



NT AA „ ^ , , , n . ^ 
• — , — Ll Score Probability 
AAID Length Length JL 



34652032 t3 107 



10130 



f¥irr 



1224 



3 ,0e-08 



Protein name 



Locus Name 



Acc# 



hypothetical 119. 5K protein luvrA region) :ORF 
1 protein 



pir : jgo405 



JQ0405 



Description 



ORF Name 



NT ID 



NT AA n n , _ _ , ^ 
— ^ t — l1 Score Probability 
AAID Length Length aL 



3.5.23.9A5.a...ci...l7.a.. 



10131 



1W 



3 . 5e-06 



Protein name 



Locus Name 



latent nuclear antigen 



lgp:AFOa3S01 



Acc# 



AF0835 01 



Description 



Macaca mulatta rnaamovirus 17 57 7 Rl, dihydrotolate reductase, complement " 
binding protein, ssDNA binding protein, transportprotein, glycoprotein B, 
DNA polymerase, R2 , thymidylate synthase ,R3, Bcl-2 homolog, capsid protein, 
tegument protein, thymidinekinase , glycoprotein H, major capsid protein, 
capsid protein, kinase, alkaline exonuclease, glycoprotein M, _ 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



&52£2S&l...c2...2$$ J [4^TO 



10132 



TZTT 



FT 



0.0033 



Protein name 



Locus Name 



beta-D-galactosidase 



gp:BRPLACZ01 



Acc# 



M63097 



Description 

Brugia malayi £>eta-D~galactosidase {lacZ) mRNA, partial cds . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



U£M2MX..eX./±a& ] ff9TT 



10133 



T7T 



y . le-12 



Protein name 



Locus Name 



MOCB <Tn4399) 



pir :B48487 



ACC# 



B48487 



Description 



1270 



NT 



AA 



ORF Name 



361b0i>7V cl 164 



NT ID AAID Length Length 

10134 



Score Probability 



T5T 



Protein name 

Description 
[NO-HIT '• 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



10135 



Length Length 



Score Probability 
0.014 



53 



Protein name 



Locus Name 



enveiope protein 



gp : HTVENVME 



Acc# 



M61052 



Description 



Human T-cell leukemia virus I (HTLV1 ) envelope (env) gene, 5' end. 



ORF Name 



NTID 



[3.&£uM43...±3...lil. I K?I? 



Protein name 



hypothetical protein sirii35 



Description 



NT 



AA 



AAID Length Length 
10136 



Score Probability 
T75 



Locus Name 



pir :S77439 



1 . 7e-31 



Acc# 



S77439 



ORF Name 



Protein name 

Description 
RKFITTT 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



10137 



TTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
„ — . — , Score Probability 
Length Length ^ 



1013a 



T5TT 



'S.ie-15 



Locus Name 



DNA repair protein Raac 



BTrT 



C70439 



Acc# 



C70439 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



'4027135 13 HO 



WWTT 



10133 



Length Length 
TTT 



Score Probability 



11014 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



5&U&ii£...cA...2L£& I I^TF 



10140 



TTZT 



I.2e-SS 



Protein name 



Locus Name 



transposase 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn552 0 transposase tbxpH) anctmobilization 
protein BmpH (bmpH) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



£$.3.7.£2..±1...1£ J 



10141 



Length Length 
TUT 



TI5" 



Score Probability 
6.7e-05 



ST 



Protein name 



Locus Name 



hypothetical protein 



pir:B4050El 



Acc# 



B40505 



Description 



NT 



AA 



ORF Name 



NTID 



8.13.M2..±3..„13.2... 



14920 



AAID Length Length 
10142 



TTT" 



Score Probability 
0.0034 



37 



Protein name 



Locus Name 



putative resolvase 



gp : DAS OR 



Description 



Acc# 



DesulturoioiDUS ambivalens tnpA, tnpB, rtbD and sor genes and ORF2,ORF3, 
0RF4 and ORF5 . 



1272 



ORF Name 



12944067 ±2 4 



Protein name 

Description 
INO-HIT 



NT 



AA 



NT ID 



AAID 



H53T" 



10143 



Length Length 



Score Probability 



T0775~ 



Locus Name 



Acc# 



ORF Name 



|4QaBtia2..±l.„I.... 



Protein name 

Description 
INO-HIT 



NT 



AA 



NT ID 



AAID 



10144 



Length Length 
JUT 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT ~ 



NT 



AA 



NTID 



AAID 



10145 



Length Length 

174 



Score Probability 



J7T 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



lifta...ci...ia i wtzz 



(10146 



11278 



'1208 



a.6e-123 



Protein name 
Description 

(GLYCINE AOETYLTRANSPlikASE) 



Locus Name 



sp:KBL_ECOLl 



Acc# 



P07912 



1273 



ORF Name 



\2bl^WA tl 7 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



AAID 



10147 



— _ „. — ^ Score Probability 

Length Length 

1 [7TS 

Locus Name Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID 



i4ittitLiti...t2„.a I 



10148 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



5.1&M&2l..g2....27.... 



Protein name 



NTID 



AAID 



10149 



hypothetical protein PH077.8 



Description 



NT 



AA 



T T — . , Score Probability 
Length Length 



TT 



0.018 



Locus Name 



pir :D7ll26 



Acc# 



D71126 



ORF Name 



NTID 



E217.3iI.t2J. I 



Protein name 



NT 



AA 



AAID Length Length 
10150 



JUT 



Score Probability 
TM ■ 



Locus Name 



gp:D42067 



0.0078 



ACC# 



D42067 



Description 

Porphyromonas gingivalis DNA tor Fimbrilm, ORFl-4, complete cds . 



1274 



NT 



AA 



ORF Name 



NT ID 



AAID 



10151 



Length Length 



T5T 



Score Probability 
TJD 



l.be-08 



Protein name 



Locus Name 



gp:VCH231106 



Acc# 



AJ231106 



Description 
Vibrio cholerae z47t gene. 



ORF Name 



NTID 



AAID 



7S5235 ±2 11 



10152 



Protein name 



hypothetical protein F08F3.4 



Description 



NT 



AA 



Length Length 
TIT 



275" 



Score Probability 




Locus Name 



pir :T29433 



3.Se~55 



Acc# 



T29433 



NT 



AA 



ORF Name 



NTID 



10..7.5.7.3.3.3...±2...49.., 



AAID Length Length 
T77T5 



10153 



Score Probability 
TIT7 — 



3.4e-07 



Protein name 



Locus Name 



unknown 



|gp:U9£771 



Acc# 



U96771 



Description 



Prevotella Joryantii putative polygalacturonase, B-l, 4 -endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



NTID 



AAID 



NT AA 
r ~*.u T — h Score Probability 
Length Length 



iaaiftaft5Lci...iCL& I mzi 



10154 



1060 I \TTW5 



i.8e-5? 



Protein name 



Locus Name 



beta-N-Acetylglucosammidase 



gp:AB0U8771 



Acc# 



AB 008 771 



Description 



streptomyces . thermoviolaceus nagA gene rorbeta-N-Acetylgiucosammidase, 
complete cds . 



1275 



NT 



AA 



ORF Name 



NTID 



114657^7 fi 70 



AAID Length Length 
TIB 



10155 



Score Probability 




II. 9e-54 



Protein name 



Locus Name 



'sp:Y796_MHTJA 



Acc# 



Q58206 



Description 

HYPOTHETICAL ABC TRAWSPO&TEk ATP -BINDING P&0TE1N MJ0756 



ORF Name 



NTID 



NT AA 
T — _ T — _ Score Probability 
AAID Length Length ^ 



17010202 t2 40 



10155 



ll.be-17 



Protein name 



Locus Name 



conserved nypotnetical protein MTH.695 



ir:F69192 



Acc# 



F69192 



Description 



NT 



AA 



ORF Name 



NTID 



13.9.5. 3.U5u..£.3....r/.3. I 14935 



AAID Length Length 
785 



10157 



TFT" 



Score Probability 
TUL 



|1.8e-17 



Protein name 



Locus Name 



RNA polymerase sigma factor SigZ-like protein 



gp:AF1372M 



Acc# 



AF137263 



Description 



Bacteroicles tnetaiotaomicron 3 OS ribosomal protein si6-HJceprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



TWIG- 



AAID Length Length 
10158 



1007 



3024 



Score Probability 
5.9e-58 



Protein name 



Locus Name 



beta-N-Acetylglucosaminidase 



gp:AB008771 



ACC# 



AB008771 



Description 



streptomyces thermoviolaceus nagA gene torbeta-N-Acetylglucosamimctase, 
complete cds . 



NT 



AA 



ORF Name 



NT ID 



'212757 cl 80 



AAID Length Length 
110159 



EES" 



Score Probability 
2T7 



« .ye-24 



Protein name 

Description 
SNA MISMM ' cJH REJ^Alk EEtoTHIKf MUTS 



Locus Name 



sp:MU i rajrHjyAQ 



Acc# 



Q56215 



ORF Name 



NT ID 



AAID 



NT AA Score 
Length Length 



2l6Sl502 t3 74 



10160 



Probability 
|3.2e-14 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AF051^1 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A (psrA) , ECF sigma factor itiul) , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore ; receptor 
(fiuA) genes, complete cds . 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



l 2M7.50.5.3...±1...2ft I W5T$ 



10151 



3258 



Protein name 



Locus Name 



receptor antigen (RagA) 



bp:PGI130-8'72 



Acc# 



AJ130872 



Description 



Porphyromonas gingivaiis W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 



Score Probability 
5T2 — " 



|4.Se-4S. 



Protein name 



Locus Name 



glucose Kinase 



bp:BM<3LUCKIK 



Acc# 



AJ000005 



Description 
bacillus megacerium giK gene. 



1277 



NT 



AA 



ORF Name 



24551537 £2 41 



NTID AAID Length Length 

10162 



TTUT 



Score Probability 
p .4e-16 



Protein name 



Locus Name 



hypothetical protein sirl207 



bxr:S7754i 



Acc# 



S77541 



Description 



NT 



AA 



ORF Name 



2S.!ib.:/.aiS....t2...4.2.. 



4942 



NTID AAID Length Length 

10164 



FTSTT 



Score Probability 

v&i — 



|6.2e-65 



Protein name 



Locus Name 



immunoreactive 51kD antigen PG52 



gp:AF1757iy 



ACC# 



AF175719 



Description 



Porphyromonas gingival is strain W5 0 immunoreactive 51KD antigenPG52 gene, 
complete cds . 



ORF Name 



NTID 



AAID 



NT AA 
T — . , T — . , Score Probability 
Length Length 



2L6.6.aa23.2L...tl...i:/. 



4943 



110165 



TIT 



l.^e-26 



Protein name 

Description 
SOS RIBOSOMAL PROTEIN LI 9 



Locus Name 



sp:RLiy_STRTR 



Acc# 



034031 



NT 



AA 



ORF Name 



i£.7.flli.7...±l...i& I [4^T4 



NTID AAID Length Length 

110166 



TTT 



Score Probability 
6.627 



Protein name 



Locus Name 



hypothetical protein BB0794 



pir :A70199 



Acc# 



A70199 



Description 



1278 



NT 



AA 



ORF Name 



NTID 



26750178 cl 104 



4 945 



AAID Length Length 
10167 



Score Probability 




5.1e-15 



Protein name 



Locus Name 



udp- sugar Hydrolase 



pir :A72201 



Acc# 



A72201 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



1±6.11$.2..±2...B± I [iM^ 



10158 



Length Length 
721" 



2175 



Score Probability 
— 



|8.ie-lii 



Protein name 



Locus Name 



melxbiase 



gp ; TEMELA 



Acc# 
Y08557 



Description 

T. ethanolicus melA and lacA genes. 



ORF Name 



NTID 



NT AA 
T — T — ' ■ Score Probability 
AAID Length Length ^ 



isa7.ftm„.c2...m i fmt 



10165 



1029 I 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



ORF Name 



Protein name 

Description 
MO -HIT 



NTID 



AAID 



4948 



10170 



NT 



AA 



Length Length 
1 



Score Probability 



202 



Locus Name 



Acc# 



1279 



NT 



AA 



ORF Name 



NT ID 



AAID 



14305138 11 22 



10171 



Length Length 



Score Probability 




9. Je-14 



Protein name 



Locus Name 



alpha -xylosiaase 



pir :A72394 



Acc# 



A72394 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
10172 



TTTT 



Score Probability 
TUS 



|4.2e-27 



Protein name 



Locus Name 



conserved hypothetical protein yknZ 



pir;E65858 



Acc# 



E69858 



Description 



ORF Name 



NT ID 



S.0£6.ZA2..±X..±$..: I 14951 



10173 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
226 



Locus Name 



p.ir:S76546 



B.0e-i8 



Acc# 



S76946 



ORF Name 



Protein name 



NT ID 



NT AA 
— . — _ T — Score Probability 
AAID Length Length — ■ jL 



10174 



putative alpha-glucosidase 



Description 



TST" 



1.8e-05 



Locus Name 



gp:AAC2521bl 



Acc# 



AJ252161 



Alicyclo£>acillus acidocaidarius maitose/maltodextrine transportgene region 
(malEFGR genes, cdaA gene and glcA gene) . 



1280 



NT 



AA 



ORF Name 



NT ID 



5339381 £3 75 



4953 



— — Score Probability 
AAID Length Length — JL 

10175 



33T 



157 



|3.1e-10 



Protein name 



Locus Name 



putative alpha-glucosxdase 



gp:AAC252161 



Acc# 



AJ252161 



Description 

Alxcyclobacillus acxdocaidarius maltose/maltodextrine transportgene regxon 
(malEFGR genes, cdaA gene and glcA gene) . 



NT 



AA 



ORF Name 



NTID 



§22127 cl 105 



AAID Length Length 



10116 



Score Probability 
|S.le-2S 



32$ 



Protein name 



Locus Name 



|sp:5MTD_I)IS0M 



Acc# 



P29240 



Description 

S 1 -NUCLEOTIDASE y'MICUkSOR, (ECTO- NUCLEOTIDASE) 



W; PR 
p) |!;B, 



NT 



AA 



ORF Name 



NTID 



ltt3.^5.3.Z.7....a2....2. 



14955 



AAID Length Length 
10177 



TIT" 



Score Probability 

ese — 



|1.2e-21 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pxr : JC6027 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— _ — ^ Score Probability 
Length Length 



a.3.aaai5a.»a3L...Aa [ 14956 



10178 



T5T" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1281 



NT 



AA 



ORF Name 



NT ID 



AAID 



16589717 c2 2& 



10179 



Length Length 
JUT 



Score Probability 
TS7 



2 . ^e-09 



Protein name 



Description 



Locus Name 



gp : MMS AG 



ACC# 



X84710 



M.mazei surtace antigen genes or±492, or£375 and or£783. 



ORF Name 



NTID 



I240241&2 tJ id 



AAID 



10180 



NT AA 

— , — - Score Probability 
Length Length • £ ~ 



JIT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



240.&&Ml...c3....4£ I 



10161 



Length Length 



Score Probability 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



4 96 0 



AAID 



110182 



t «™+->, ro^t-v, Score Probability 
Length Length 



ITTTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1282 



ORF Name 



NT ID 



AAID 



NT AA 

— , , — , Score Probability 
Length Length jL - 



125370887 c3 4K 



Protein name 



10163 



T7T 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— Ll , — , Score Probability 
Length Length JL 



|2ft£II£lL.cA...44 



Protein name 



10184 



Locus Name 



0.0075 



Acc# 



hypothetical protein BB0212 



Description 



pir:D7012fo 



D70126 



ORF Name 



NTID 



AAID 



NT AA 
T ~<-u T ^r~. , Score Probability 
Length Length 



\io.o±S££...±i..±2 ...i f^t 



10185 



TuT 



0.024 



Protein name 



Locus Name 



Acc# 



probable chxtinase 



pxr :T42071 



Description 



T42071 



ORF Name 



NTID 



3.2lM0.$.5.3....c1...2£ I [4^4 



Protein name 



AAID 



10186 



NT 



AA 



Length Length 



Score Probability 



T55" 



Locus Name 



Acc# 



Description 
IWO-HIT " 



ORF Name 



NTID 



AAID 



15.141.±3....li 



10187 



Protein name 



otnA protein 



Description 



NT AA 
^ — . — Score Proba bility 
Length Length -E - 



T87F" 



T7T" 



i.6e-44 



Locus Name 



Ipir:&i70958 



Acc# 



S70958 



1283 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NT ID 



AAID 



4366 



10188 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT ID 



AAID 



hypothetical protein siribis 



Description 



NT 



AA 



„ — L1 _ — . Score Probability 
Length Length — 



Locus Name 



|p2r:S75464 



2.4e-0B 



Acc# 



S75464 



NT 



AA 



ORF Name 



X21253iL„C2...3.5. 



NTID AAID Length Length 

10190 



Score Probability 
TUB 



7.9e-17 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-l, 4 -endoglucanase, and 
mannanase genes, complete cds ; and unknowngenes . 



ORF Name 



NTID 



AAID 



NT AA 
T — — Score Probability 
Length Length — -L 



inmm..£aji& .....i mzv 



10191 



TTTT 



9.7e-S0." 



Protein name 



Locus Name 



carboxyl- terminal proteinase 



pir : F70369 



Acc# 



F70369 



Description 



1284 



NT 



AA 



ORF Name 



NT ID 



AAID 



14492660 c2 



E 



T7JT 



10192 



Length Length 
7T~~ 



Score Probability 



\ZTT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT"" 



NT 



AA 



ORF Name 



NTID 



T — ' 1 — ^, Score Probability 
AAID Length Length z - 

10193 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



115A85.5.2...C.2...9.0. I 



AAID Length Length 
10194 



77T 



2319 



Score Probability 
3902 



TOT 



Protein name 



Locus Name 



JDeta-glucosiciase 



gp:AP00S658 



Acc# 



AF006658 



Description 



Bacteroides fragilis beta-glucosidase gene, complete cds , 



NT 



AA 



ORF Name 



24&D.6.S..7.6....C2...S.S..... I 15573 



, „™ T — L1 „ — ^ Score Probability 
NTID AAID Length Length JL 

110195 



TUT 



7.9e-05 



Protein name 



Locus Name 



unknown 



|gp:AF124349 



Acc# 



AF124349 



Description 

Zymomonas mobxlxs ZM4 tosmict clone 41A4, complete sequence. 



1285 



NT 



AA 



ORF Name 



NTID 



AAID 



126355883 c3 101 



10156 



Length Length 

P^i5 — i irrai — 



Score Probability 
3T5 



M-5e-39 



Protein name 

Description 
HYPOTHETICAL STOAk KINASE tiLRObi'V 



Locus Name 



sp:YM37_SYMY3 



Acc# 
Q55480 



NT 



AA 



ORF Name 



NTID 



AAID 



133242062 c3 100 



10157 



Length Length 
£2T 



12475 



Score Probability 
lti.Se-82 



Protein name 



Locus Name 



hypotneticai protein TM0280 



pir:f72335 



Acc# 



F72395 



Description 



NT 



AA 



ORF Name 



NTID 



3.3.&6.m&...aL...7.8... 



14376 



AAID Length Length 
10138 



loos I [jttth 



Score Probability 
\$T2 



1.3e-32 



Protein name 



Locus Name 



receptor antigen (RagAj 



gp:PGI130872 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen (rag) locus encodinga ma] or 
immunodominant 55kDa antigen . 



NT 



AA 



ORF Name 



NTID 



|4dl3.3..7..7....ci..^..7.. 



AAID Length Length 



110133 



TT3T" 



Score Probability 
5.4e-U6 



Protein name 



Locus Name 



sp:XYMB PRERU 



Acc# 



P48791 



Description 

1, 4-BETA-XYLOSIDASE) (EXO-BETA- (1,4) -XYLANASE} 



1286 



ORF Name 



'44>6iSVa c2 £6 



Protein name 



NT ID 



NT AA „ 
, , „ ^ — i ^ — A _. Score Probability 
AAID Length Length JL 



10200 



[IMF- 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



aa^5.i2....az...a6. 



Protein name 



NTID 



497$ 



AAID 



10201 



NT 



AA 



Length Length 
3T7~ 



Score Probability 



Locus Name 



Acc# 



Description 
ETO^HTT 



ORF Name 



15.7A7M0...±3....1 



Protein name 



NTID 



4980 



AAID 



110202 



NT AA 
T — ^ t — x.t_ Score 
Length Length 



ITTT 



Locus Name 



Probability 



Acc# 



Description 
MO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
T — , v, T — . , Score Probability 
Length Length 



4981 



10203 



TIT 



S.0e-18 



Locus Name 



lsp;YNWbM^OLI 



Acc# 



P77522 



Description 

HYPOTHETICAL 55 . 3 KB PROTEIN IN LPP-AROD I NTERG E NIC REG I ON 



1287 



NT 



AA 



ORF Name 



'21^1^41 c3 4 



NT ID AAID Length Length 

10204 



7T 



Score Probability 
ETJ 



2.1e-lS 



Protein name 



Locus Name 



probable oxidorecluctase 



pxr :T34993 



Acc# 



T34993 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



iiaMi6.a..±3....i 



10205 



Length Length 
W 



Score Probability 
TT7 



2 . 2e-28 



Protein name 



Locus Name 



4 -methyl -5 (b- hydroxy ethyl) -thiazole 
monophosphate biosynthesis protein (thiJ) 
homo log : 



pir:D70177 



Acc# 



D70177 



Description 



ORF Name 



NTID 



NT AA 
T — _ _ — Score Probability 
AAID Length Length JL - 



21^y^6.6.1...c2...2 ...I HTSW 



10205 



F£uT" 



1 . 3e-14 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



ACC# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



\±o.9AiSAb....ti...i I mzs 



10207 



Protein name 

Description 
NO-HIT 



NT 



AA 



Length Length 
TO - 



Score Probability 



[2TT 



Locus Name 



Acc# 



1288 



ORF Name 



113707933 tl 1 



Protein name 



omp85 analog 



Description 



NTID 



AAID 



10208 



NT 



AA 



Length Length 



Score Probability 
TD5 



Locus Name 



pir :D7^J094 



0.0014 



Acc# 



D72094 



ORF Name 



Protein name 

Description 
[NO-HIT 



NT 



AA 



NTID 



AAID Length Length 
110203 



Score Probability 



59^ 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NTID 



NT AA 

._ _ _ _ — _ T — _ Score Probability 
AAID Length Length ~ JL 



10210 



Locus Name 



Acc# 



ORF Name 



2.£7.a6.o.6.:/....t3....a 



Protein name 



NT 



AA 



NTID 



AAID Length Length 
10211 



PIT 



TOT" 



Score Probability 

TM — : 



0.00010 



Locus Name 



Acc# 



|sp:YJDBJ!C6LI 



Description 

HYPOTHETICAL 61.7 KD M0TE1N IN BA3S-ADIY INTERSENIC REGION 



1289 



NT 



AA 



ORF Name 
132031466 ii 3 



NT ID AAID Length Length 

10212 



TXT 



Score Probability 
0,00025 



Protein name 



Locus Name 



sp:VBi^_ECOLI 



Acc# 
P75785 



Description 

HYPOTHETICAL 5S.7 Kt> £>R0Tfil3tf IN OM&X-MOfiB INTSfttiEMIC REGION 



NT 



AA 



ORF Name 



10635006 tl 1 



— — Score Probability 
NT ID AAID Length Length JL 

| [10213 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

urn — i mi — 



Score Probability 



1 110214 



Protein name 
Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NT ID 



23.9.9.L6.6.2....JL3 — 1 .... 



4993 



AAID Length Length 
7TU~ 



10215 



Score Probability 




2.2e-53 



Protein name 



Locus Name 



Acc# 



mobilization protein A 



|gp:AV118241 



AF118241 



Description 

Bacteroicles rragilis mobilization protein A (mobA) gene, compietecds . 



12 90 



NT 



AA 



ORF Name 



NTID 



1^41^5 c2 



14994 



AAID Length Length 
10215 



Score Probability 



TUT" 



Protein name 



Description 



Locus Name 



Acc# 



NO -HIT 



NT 



AA 



ORF Name 



NTID AAID Length Length 

10217 



\TT& 1 I57T 



Score Probability 
51 



0.047' 



Protein name 



Locus Name 



polymorpmc outer membrane protein G tamily 



bp;AB033V<J4 



Acc# 



AB033794 



Description 



Chlamyctophila pneumoniae pmp_3 . 1 gene tor polymorphic outermembrane protein 
G family, complete cds . 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J ~ 



1021$ 



186 



Protein name 

Description 
[NO-HIT 



Locus Name 



ACC# 



NT 



AA 



ORF Name 



NTID 



mfis.7.fcL.±a...a I 



AAID Length Length 
1488 



Score Probability 



10219 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1291 



NT 



AA 



ORF Name 



NT ID 



AAID 



I24641S77 c3 33 



110220 



Length Length 

tts — I rriB 1 



Score Probability 



Protein name 

Description 
ITO^HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



a5sami„.c3L...4Q I izss? 



10221 



Length Length 
333 1 [TFFS — 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



In 



to . 

f i :i 



NT 



AA 



ORF Name 



NT ID 



AAID 



i411D.M.0....c.:i...3.7. I FTTJTT 



10222 



Length Length 



Tim- 



Score Probability 
TM 



0.032 



Protein name 



Locus Name 



hypotnetical protein Y26D4A.9 



pir:T2&569 



Acc# 



T26569 



Description 



13 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

5001 1 110223 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID AAID Length Length 
mU2 1 110224 1 [3T7 1 [TUTS — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



1292 



ORF Name 



|2636'87S7 c2 29 



Protein name 



Description 



BTO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 

or 



Score Probability 



Locus Name 



Acc# 



ORF Name 



\lll&L5!!L.dlJ±l.. 



Protein name 



Description 



NO-HIT 



NT 



AA 



NTID 



AAID 



SHUT* 



10226 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



10227 



Length Length 
2TT7 



708 



Score Probability 
525 



|3.3e-178 



Locus Name 



conserved hypothetical protein ydcl 



|pir:<^9773 



Acc# 



G69773 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ■ 



10228 



TUT 



324 



73" 



Protein name 



Locus Name 



Acc# 



E3 class 2 protein 



pir :B46308 



B46308 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



£2L&5.8.17....al..3„7. I 



10225 



fZTT 



Protein name 



Locus Name 



Acc# 



Description 
INO-HTT — 



1293 



ORF Name 



NTID 



i4aa^76a £3 25 



110230 



Protein name 



conserved nypothetical protexn yisg 



Description 



NT 



AA 



AAID Length Length 



Score Probability 




Locus Name 



|pir:H69837 



Acc# 



H69837 



NT 



AA 



ORF Name 



15.M20.6.2..±3....2!i. I |FTT^ 



NTID AAID Length Length 

110231 



77T" 



Score Probability 

Tn — 



2.4e-17 



Protein name 



Description 



Locus Name 



|gp:SPU5S23£ 



Acc# 



U59236 



Synecnococcus PCC7242 rxbosomai protein si ot 30S riJDOsome (rpsi) , ORF271, 
ORF231, ORF341, carboxyl transferase alpha subunit (accA) , ORF245 , ORF227, and 
GTP cyclohydrolase I (folE) genes, completecds, and ORF205 gene, partial 
cds . 



ORF Name 



NTID 



NT AA 

— ■ , — , Score Probability 
AAID Length Length 



OTTO" 



10232 



TT8 



|2.4e-17 



Protein name 



Locus Name 



Acc# 



sp:YESR_EeOLl 



Description 

HYPOTHETICAL 20,3 Kb PROTEIN IN PRC-.L>PHA IN T EROENIC REGION 



NT 



ORF Name 



NTID 



AAID 



5011 



10233 



Length Length 
2UT 



AA 

— , Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1294 



NT 



AA 



ORF Name 



NTID 



AAID 



— , — , Score Probability 
Length Length ^ 

w& — 



T5T 



Protein name 

Description 
ro^TTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



2M.7.3.S.2.:/...±1...I 



AAID Length Length 
^33 



10235 



TFTT 



Score Probability 
T^S 



l . ye-14 



Protein name 



Locus Name 



two component sensor 



gp:AF030352 



Acc# 



AF030352 



Description 



Pseudomonas aeruginosa two component sensor (lemA) gene, partialcds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



414 



Protein name 



Locus Name 



conserved hypothetical protein 



pir :G72220 



Acc# 
G72220 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — . . Score Probability 
Length Length 



3.9.£.7.:L&.2....£.3....2Z.. 



10237 



TOT 



1752 



FTTe^TT 



Protein name 



Locus Name 



Acc# 



2 ' ,3 1 -cyclic-nucleotide 2 1 -phosphodiesterase, 
precursor 



bir:H64532 



H64532 



Description 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
TT9 



Score Probability 



TIT- 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



5T5TT 



110235 



761T 



12301 



7TTe=Tl 



Locus Name 



sprCIRAJilL'oLl 



Acc# 



P17315 



C0LIC1W I ftECEPTOfe PRECURSOR 



ORF Name 



Protein name 



NT ID 



AAID 



110240 



NT AA „ „ , , . , . . 

— — Score Prob ability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10241 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



JWO-HIT 



1296 



NT 



AA 



ORF Name 



NTID 



25355015 c2 21 



5020"' 



AAID Length Length 
T5T0 — 



10242 



Score Probability 
7.4e-74 



440 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



Description 



Acc# 



JC602 7 



ORF Name 



NTID 



440.a27.a...c3....21 J I5TJ2T 



Protein name 



AAID 



10243 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



C3 

Hat 
i! PS 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10244 



hypothetical protein jhp0042 



Description 



NT AA 

— , — , Score Probability 
Length Length 



FT7S~ 



Locus Name 



bir:H7i98i 



|4.1e-45 



Acc# 



H71981 



ORF Name 



3.3. 8.2.3.12.... tl....! 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



3T52T" 



10245 



TonB- dependent receptor HmuR 



Description 



TTT 



1.3e-05 



Locus Name 



gpTPGuS7395 



Acc# 



U87395 



Porphyromonas gingivalxs TonB -dependent receptor HmuR (hmuR) gene , complete 
cds . 



1297 



ORF Name 



NTID 



AAID 



NT AA „ _ , , . _ . . 
— — , Score Probability 
Length Length 



1038140 tl 2 



Protein name 



10246 



3288 



Locus Name 



Acc# 



Description 



ftTO-HlT 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



10247 



£7TT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



10.9.7.S.5.a.7....ci...l2b.. 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



TUT 



TJTT 



Locus Name 



9.4e-30 



Acc# 



sp:MSCL_kkWc!A 



Description 

LAR(jE - CONDUCTANCE M E CHANOSUNSITIVU CHANNEL 



068284 



ORF Name 



imaiiiULi 



Protein name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



H0249 



~5T 



Locus Name 



Probability 



Acc# 



Description 



MO-HIT 



1298 



• 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



13017676 cl tib 



10250 



T7F" 



1.2e-07 



Protein name 



Locus Name 



trsl protexn (tral) 



|gp:Ali001272 



Acc# 



AE001272 



Description 



Lactococcus lactis tiK13l47 plasmid pMRCOX, complete plasmictsequence . 



NX 

— — Score Probability 
NTID AAID Length Length 

110251 



AA 



ORF Name 



Il375l250 c2 10b 



|3.Be-i6 



Protein name 



Locus Name 



hypothetical protein l 



[pir:140237 



Acc# 



140237 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



5030 



10252 



Length Length 




Score Probability 

121 .... 



0 . 00075 



Protein name 



Description 



Locus Name 



Acc# 



gp:T7CG 



Genome or r>acterxopnage TV . 



NT 



AA 



ORF Name 



NTID 



AAID 



143.1£3.3.3....cl...:^.. 



10253 



Length Length 
WTZ 



Score Probability 



140 



Protexn name 



Description 



Locus Name 



Acc# 



INO-HIT 



1299 



NT 



AA 



ORF Name 



NT ID 



16531331 c3 J^ti 



AAID Length Length 
W7V 



10254 



Score Probability 
0.00053 



Protein name 



Locus Name 



ras interacting protein kipa 



gp:AFlB^41 



Acc# 



AF159241 



Description 



Dictyostelium discoideum ras interacting protein RIPA (ripA) mRNA, complete 
cds . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



10255 



Score Probability 
7.3e-10 



Protein name 



Locus Name 



tetracycline resistance element mobilization 
regulatory protein rteC 



|pir:A36$27 



Acc# 



A36927 



Description 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



5im~ 



10256 



415 



TTT 



5.2e-06 



Protein name 



Locus Name 



clostripam-relatect protein 



|pir:£72ibi 



Acc# 



B72351 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



m5.413A...c2....11iA., 



5TF33 - 



10257 



Length Length 
25B 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



1300 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



24641647 c2 103 



1025a 



TTT 



SWT 



3.5e-46 



Protein name 



Description 



Locus Name 



Acc# 



sp:AQPZJ«!CoLl 



AOUAPOklM Z (BACTERIAL ^ubUhl^-LlKM m'1'K.M SIci ^koTiilM) 



NT 



AA 



ORF Name 



NTID 



AAID 



2464S261 c3 132 



10259 



Length Length 

[1137 



Score Probability 
1.2e-46 



Protein name 



Description 



Locus Name 



sp:YHCS_IiCoLi 



Acc# 



P45423 



HYPOTHETICAL 44 . i Kl) PROTLIHJ IN (jLTF-NANT INTUkcikNl O kUGluM (uiVb) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
T^l 



Score Probability 



10260 



T2TT 



Protein name 



Description 



Locus Name 



Acc# 



BT0-U1T 



NT 



AA 



ORF Name 



NTID 



AAID 



10261 



Length Length 
¥T7 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO -HIT 



1301 



ORF Name 



52317217 rl 4 



Protein name 



NT ID 



NT AA 

— — Score Prob abi lity 
AAID Length Length ■ 



10252 



143 



[llT- 



Locus Name 



Acc# 



Description 
IKIO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



12lll±2b....a±Jm I 



5041 



1026^ 



7TT3~ 



T7TT 



1.8e-13 



Protein name 



Locus Name 



immunoreactive 42KD antigen pujJ3 



|gp:APi7B7ib 



Acc# 



AF175715 



Description 



£>orphyromonas gingivalis strain W50 immunoreactive 42kD antigenPG3 3 gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10254 



75" 



0.025 



Protein name 



Locus Name 



elongation ractor ts 



gp:AF19byb2 



Acc# 



AF195952 



Description 



Phaeodactylum tricornutum ribulose-i , b-bisphospnatecarsoxyiase/ oxygenase 
large subunit (rbcL) , ribulose-1 , 5-bisphosphate carboxylase/oxygenase small 
subunit (rbcS) , and elongation factor Ts (EF-Ts) genes, complete 
cds;chloroplast genes for chloroplast products. 



ORF Name 



NTID 



NT AA 

— — , S core Probability 
AAID Length Length — 



10265 



Protein name 



Locus Name 



Acc# 



Description 
MO -HIT 



1302 



ORF Name 



3529293^ £3 



Protein name 



NTID 



AAID 



10266 



NT 



AA 



Length Length 
TT2 



Score Probability 



143 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



41I£.2.6.2...a2.„.lUb... 



Protein name 



NTID 



AAID 



10257 



DNA-binding protein, HU 



Description 



— — Score Probability 
Length Length 



Locus Name 



pir:H72Jy6 



l.le-12 



Acc# 



H72396 



ORF Name 



Protein name 



NTID 



AAID 



10268 



— — Score Probability 
Length Length 



WT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



10269 



DNA topoisomerase III topB 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



3IZ" 



Locus Name 



pir:k69724 



|2.0e-91 



Acc# 



H69724 



1303 



NT 



ORF Name 



'5250000 c2 109 



NTID AAID Length Length 



AA 

— Score Probability 



5"uTF 



[10270 



178 



14 . Oe-10 



Protein name 



Locus Name 



high molecular weignt glutenin summit: 



lgp:ASU:i922y 



Acc# 
U39229 



Description 

Aegxlops tauschn high molec ular weight giutenm sununit (Glu-l-2 ) gene, 
complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


58S5l87_tl_5 5049 


10271 


165 


498 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


fiSiaaa^ti»;iu 


5050 


10272 


69 


210 






Protein name 








Locus 


Name 


Acc# 


Description 














MO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±1M$M±±1...± 


5051 


10274 


163 


452 


183 


3.6e-14 


Protein name 








Locus 


Name 


Acc# 



unknown 



Description 



bp:AP04«74y 



Bacteroides tragilis capsular polysaccharide biosyntnesis operon, compieue 
sequence . 



1304 



NT 



AA 



ORF Name 



NT ID 



224694b2 ti 2 



AAID Length Length 




10274 



T7T" 



Score Probability 
I3.2e-13 



T7T" 



Protein name 



Locus Name 



"unknown 



bp:AP04a74y 



Acc# 



AF048749 



Description 



"Bacteroides fragi iis capsular poiysaccnaricle Biosynthesis operon, comp-Lete 
sequence . 



NT 



AA 



ORF Name 



NT ID 



34266^6 Cl 4 



AAID Length Length 
355 



Score Probability 



10275 



T2T 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



110276 



Length Length 

wn — 



Score Probability 
|5.5e-50 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



tpir:JCfe0ia7 



ACC# 



JC6027 



Description 



— — Score Probability 

Length Length 



ORF Name 



NTID 



AAID 



10277 



li.Oe-OtJ 



Protein name 



Locus Name 



|sp:H0L£ UAKlJsJ 



Acc# 



P43748 



Description 
DNA POLYMERASE 111, DlilL'l'A 1 dUBUNIT, 



1305 



ORF Name 



Protein name 



NTID 



1027a 



NT 



AA 



AAID Length Length 




Score Probability 



f7T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



±££A±±XL±2..±h... 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



10273 



T72~ 



Locus Name 



carboxyl- terminal proteinase 
ctpB: hypothetical protein slr02 57 : hypothetical 
protein slr0257 _ _ - 



Description 



|pir:ti74b7u 



0.00036 



Acc# 



S74579 



ORF Name 



lS..7.S.40....a2....l2. 



Protein name 



NTID 



STIFF" 



AAID 



10280 



NT 



Length Length 



AA 

— Score Probability 



T7T~ 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10251 



— — Score P robability 
Length Length — — 



3T" 



Locus Name 



Acc# 



Description 



INO-HIT 



1306 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



24648941 41 



5050" 



TUZZT 



1494 



8.7e~75 



Protein name 



putative TonB - dependent outer membrane 
receptor 



Locus Name Acc# 
gp:AJ?'04B749 | AF048745 



Description 

Sacteroides tragiiis capsular polysaccha ride biosyntnesis operon, complete 



sequence . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


2S78S15i_c3_i4 


5061 


10285 | 


P 1! 


183 








Protein name 








Locus 


Name 


Acc# 


Description 


















MO-HIT 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


2M5.1i:Ab....cl...2L6. 


5062 


10284 


65 


198 








Protein name 








LOCUS 


Name 


Acc# 


Description 


















NO-HIT 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


29..7.3.3.3..7A...cl...lb. 


5063 


10285 


931 


2796 




136 


0.00047 



Protein name 

hypothetical protein FfB0540w 



Locus Name 



] tP ir:D7161ir 



Acc# 



D71612 



Description 



1307 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



3145252 ci 24 



102 ST 



1.3e-06 



Protein name 



Description 



Locus Name 



Acc# 
Q03027 



ALKALINE PROTEAN ^CRBTIOM PROTEIN APk* 1 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
7W 



Score Probability 



10267 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



10288 



Length Length 



Score Probability 
u7W35 



53 



Protein name 



Locus Name 



putatxve H^P2 0 



|gp:AF071>87b 



Acc# 



AF072875 



Description 



Mycobacterium smegmatis putative HSP2 0 msp; gene, complete cas . 



NT, 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10229 



T3T" 



2 . 2e-0b 



Protein name 



Locus Name 



ORF MSV261 leucine ncn repeat: gene tamiiy I |gp : AF063866 



Acc# 



AF063866 



Description 



Melanoplus sanguinipes entomopoxvirus , complete genome. 



1308 



NT 



AA 



ORF Name 



NTID 



AAID 



10429557 t3 3 



10290 



Length Length 




Score Probability 



77 



Protein name 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



— — Score Probability 



|ilOJ.5.7.5...±l...A 



AAID Length Length 



10291 



1185 



3TT" 



|1.3e-54 



Protein name 



Locus Name 



115K outer membrane protein precursor : sus<J 
protein 



pir : JUfeU^V 



Acc# 



JC6027 



Description 



NT 



ORF Name 



NTID 



3u"7u~ 



AAID Length Length 



AA 

— Score Probability 



10292 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



±±&Ab.Zb;±..±l..J.O. 



10293 



1152 



[233" 



14.8e-20 



Protein name 



Locus Name 



conserved nypotneticai protein 



hpir:U72273 



Acc# 



H72273 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|l5.^.7.b.fo.B.B...±l..l 



3u"7T 



10254 



Length Length 
T¥73 — 



Score Probability 
|3.1e-63 



Protein name 



Locus Name 



conserved hypothetical protein yngK 



|pir:H&9B93 



Acc# 



H69893 



Description 



1309 



ORF Name 




NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


21521937_clJ/0 


5073 


1029b 


301 


505 


231 


2.9e-19 


Protein name 










Locus 


Name 


Acc# 












sp:SCRK 


jSALTY 


P26984 


Description 
















FRUOTC^NASE, 1 


ORF Name 




NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


240234b7_t3_b2 




5074 


1029£ 


768 


2307 


373 


8.2e-31 


Protein name 










Locus 


Name 


Acc# 












sp : LEMA 


J>SE3Y 


P48027 


Description 
















SENSOR UkoTUlN 


LEMA, 










i 


ORF Name 




NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2&19l61L1...cl±JJA 




5075 


10297 


282 


849 


752 


i.8e-74 



Protein name 



Locus Name 



Acc# 



P14176 



Description 

GLYCINE IjETAlNE/L-^ROLlMJbJ TRANSPORT £i¥riT EM PlilRMklASE protein fkuW 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l±±l&2M....al..±&± 


5076 


10298 


285 


858 


453 


8.7e-43 



Protein name 



Locus Name 



Acc# 



giycine-betaxne Joxnaxng permease protein 



lgp:A^l39b7b 



AF139575 



Description 

Lactococcus lactxs BusAA (bus AA) and glycxne-betaxne Joxnaingpermease 
protein (busAB) genes, complete cds. 



1310 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



24644705 cl 7i 



12730 



|2.7e-55 



Protein name 



Locus Name 



hybrid nistictme Kinase 



gp:AJ ? , 02yV04 



Acc# 



AF029704 



Description 



ftictyostelium dis coideum hybrid histiame kinase (dhKD) mKNA, complete cds . 



ORF Name 



2464S432 ti 10 



Protein name 



OTTO 



NT 



AA 



NTID AAID Length Length 

10300 



Score Probability 



Locus Name 



Acc# 



Description 



BJ0-H11 1 



ORF Name 



Protein name 



tructanase 



Description 



NTID 



AAID 



10301 



— — Score Probability 
Length Length 



[T1W 



Locus Name 



tpir:A36S)lb 



1. ie-152 



ACC# 



A36915 



ORF Name 



26.5i9.7.13.6....cA...114.: 



Protein name 



Description 



NTID 



5080 



10302 



NT — Score Probability 



AAID Length Length 



TT7T" 



Locus Name 



sp:GLtlP_BRUAB 



ACC# 
Q44623 



1311 



NT 



AA 



ORF Name 



NTID 



2817217 cl l U 



AAID Length Length 




10303 



Score Probability 
I4.9e-i7 



2TT7 



Protein name 



Locus Name 



spiYBAklJUcJuLl 



Acc# 



P75707 



Description 

HYfrOlWl ' liJAL 14.4 Kb PRO'l'mM lU mS6-HMA IfrMkc^ie region 



ORF Name NTID AAID 


NT AA 

„. . — Score 

Length Length 


Probability 


3U'0S4532_ci_loo 50§2 10304 


2T2 1235 1028 






Protein name 


Locus Name 




Acc# 


ATPase nomoiog gjdua 


gp:AP0^y83b 




AF039835 


Description 










Listeria monocytogenes ATPase nomoiog 
membrane transport protein GbuB (gbuB) , 
protein GbuC (gbuC) genes, complete cds 


GbuA (gJDuA; , putative giycinejjeucixxit; 
and putativeglycine betaine binding 




ORF Name NTID AAID 


NT AA 

— Score 

Length Length 


Probability 


3.££S.llD.tf...±l...li i^'TOT 103 0b 


309 125 






Protein name 


Locus Name 




Acc# 


hypothetical protein APE2061 


pir:ti72blU- 




G72510 


Description 








ORF Name NTID AAID 


NT AA 

____ — Score 

Length Length 


Probability 


L1^±1JL'^± 5084 10306 


192 579 W2 




y . ue- uy 


Protein name 


Locus Name 




ACC# 



conserved nypothetical protein 



pir :G7bbbb 



Description 



1312 



ORF Name 



4712825 cl 88 



Protein name 



NTID 



10307 



— — Score Probability 
AAID Length Length — 



355" 



1197 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



— — , Score Probability 
Length Length 



10308 



11153 



3582" 



T7£!T 



TOT 



Locus Name 



pyruvate terredoxm oxidorecluctase 



| gp:CM1772 7 



Acc# 



Y17727 



Description 



Clostridium past eurianum genes encoamg putative pyruvate terreaoxin. 
oxidoreductase (8005 bp) . 



ORF Name 



NTID 



i0.3.14b.l...cZ...k.. 



TOFT 



10309 



Protein name 



hypothetical protein aq__i477 



Description 



NT 



AA 



AAID Length Length 




Score Probability 
i.8e-0& 



Locus Name 



|pir:D70428 



Acc# 



D70428 



ORF Name 



Protein name 



NTID 



AAID 



W3~ 



10310 



NT 



AA 



Length Length 
313 



— Score Probability 



Locus Name 



Acc# 



Description 



IMG-HIT 



1313 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



121562667 ti 4 



TOSS" 



10311 



TTO¥" 



2041 



4 . 6e-2ll 



Protein name 



Description 



Locus Name 



sp:TLVD_HAlillW 



Acc# 



P44851 



D 1 HYDROXY - AC ID DUkVDRATAai i l, (DAb) 



ORF Name 



NT ID 



237127SS t2 10 



TOTO" 



10312 



Protein name 



acetolactate syntnase, large summit 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TOT 



Locus Name 



|pir:B7236^ 



1 . Oe-126 



Acc# 



B72362 



ORF Name 



1±9.0.&.5.1&..±2J.L 



Protein name 



NTID 



TOST" 



rum 



NT 



AA 



AAID Length Length 
7TT5~ — 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Probability 



|3.£20.46.b.6...±0,...b. 



TOST" 



AAID Length Length 
T^5 



10314 



TO- 



ST" 



0.00SJ 



Locus Name 



Acc# 



capsici portal protein 



| gp:BlU32222 



Description 



Bacteriophage 186, complete sequence. 



1314 



NT 



AA 



ORF Name 



NT ID 



4822001 rJ 12 



AAIP Length Length 
— 



10315 



7TT 



Score Probability 
2 . 4e-20 



Protein name 



Description 



Locus Name 



gp:AF0^424 



Acc# 



AF083424 



Ateline nerpesvirus 3 complete genome. 



ORF Name 



'8678425 12 9 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



Length Length 
1154 



Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 
Length Length 



— Score Probability 



10317 



5T5" 



Locus Name 



Acc# 



MO -HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



ISAAl&ll^l^l | 



5TT^" 



AAID Length Length 



10318 



WT 



— Score Probability 
l.4e-UB 



TTT 



Locus Name 



sp:Y0B2_BokkU 



Acc# 



051081 



Description 
HYPOTHETICAL TkN A/ kWA MMTUtfL TRAMd^kASB 



1315 



ORF Name 



2442257 tl 1 



Protein name 



NT ID 



TOST 



— — Score Probability 
AAIP Length Length 



10319 



37T 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAIP Length Length 



Score Probability 



10320 



5.ie-22 



Locus Name 



sp:YM64_AkUFU 



Acc# 



028020 



HYPOTHETICAL ^RoTiail M AF2264 



ORF Name 



iiaimi...ajL„.^. 



Protein name 



NT ID 



AAID 



10321 



NT 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NT ID 



— — Score Probability 
AAID Length Length ; 



[5TW 



110322 



[T7T 



|5.5e-35 



Locus Name 



l sp;Y3fflHJAAl!aU 



Acc# 
P54947 



Description 

HYPOTHETICAL 30.2 KB PkOTElM IN IbH-bU OR IMTlalkcJEWIC kECjloM 



1316 



ORF Name 



NTID 



NT AA „ 

— — Score Probability 
AAID Length Length 



'6444402 i2 b 



148S 



7.0e-12i 



Protein name 



Locus Name 



cystemyl-tRNA syntnetase 



pir :A75368 



Acc# 



A75368 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



im5.<ni..±2L..A I 



110324 



SIT" 



Protein name 



Locus Name 



2,3,4,5- 1 e t rahyciropyr ictine - 2 - carboxylase 
N-succinyl transferase- related protein 



Description 



pir:H7224b 



4.3e-18 



Acc# 



H72245 



ORF Name 



12.3.*mb.2...±Z...b... 



Protein name 



NTID 



AAID 



10325 



NT 



AA 



Length Length 




Score Probability 



62 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10326 



NT 



AA 



Length Length 
TI2 



Score Probability 



T5 



Locus Name 



Acc# 



Description 



[NO-HIT 



1317 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



12944677 c2 b 



TUJTT 



RT5T" 



7.3e-35 



Protein name 

TonB- dependent receptor HmuR 



Locus Name 



Acc# 



U87395 



Description 

Porphyromonas gmgxvalis I'onJj- dependent re ceptor HmuR mmuRj gene, complete 
cds . 



ORF Name 


NTID 


AAID 


NT AA 

— Score 
Length Length 


Probability 


22S50^81__ci__4 


SlOS 


1032& | 


|77 ||234 




Protein name 






Locus Name 


Acc# 


Description 










NO-HIT 










ORF Name 


NTID 


AAID 


NT AA 
— — Score 
Length Length 


Probability 


xAii.A.HVl^JL 


"J 5107 


10529 


TTO 630 575 


" S.le-34 


Protein name 






Locus Name 


Acc# 


1 receptor antxgen tRagA) 




| gp: Ptill Jab 


AJ130872 


Description 




Porphyromonas gmgivaiis w^u 
immunodominant 55kDa antigen. 


receptor antigen (rag; locus enc 


ouxiiya met j ui 




ORF Name 


NTID 


AAID 


NT AA 

— Score 
Length Length 


Probability 




5108 


|10^0 


GB" 207 sr 


0.005b I 


Protein name 






Locus Name 


Acc# 


SOJcDa lectin 






| gprBMObOKDAL 


D14168 


Description 











Silk worm mRNA tor 50kDa lectin, complete cas. 



1318 



ORF Name 



NT ID 



12460837 t3 2 



10331 



Protein name 



adenylate cyclase nomolog 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



1845 



Locus Name 



pir:T17197 



9.3e-54 



Acc# 



T17197 



ORF Name 



143.£3.b.6.3....c.l...lU., 



Protein name 



NT ID 



10332 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT" 



ORF Name 



15.7A23.2i7...±l..^ 



Protein name 



NT ID 



Bill 



AAID 



10333 



NT 



AA 



Length Length 
TFT 



— Score Probability 



Locus Name 



Acc# 



Description 



NO-urr 



ORF Name 



Protein name 



NTID 



10334 



conserved nypotnetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



or 



Locus Name 



bir:(472J8U 



8. 7 e-2 7 



Acc# 



G72380 



1319 



NT 



AA 



ORF Name 



NTID 



AAID Length Length- 



Score Probability 



33772907 c2 IB 



TTT 



IT 



Protein name 



Locus Name 



HdcB 



gp:00DJ>Bttbb 



Acc# 



U58865 



Description 



Oenococcus oeni h isbidine decarboxylase indcA) gene, complete cas;ana Hacii 
(hdcB) gene, partial cds . 



ORF Name 



35267037 c2 16 



Protein name 



NT 



AA 



NTID AAID Length Length 

5114 | [10336 | |117 | |354 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



16A410.0.3...±i....i. 



Protein name 



NTID 



10337 



NT 



AA 



AAID Length Length 
T3"5 



Score Probability 



1ST" 



Locus Name 



Acc# 



Description 



BTO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Pr obability 
AAID Length Length 



3TT6" 



1033S 



ST5~ 



TUZT 



I4.6e-l0tt 



Locus Name 



inorganic pyrophosphatase 



gp:D«B8^U 



Acc# 



D88820 



Description 

Acetabularia mediterranea mRNA tor inorganic pyropnospnatase , complete cas . 



1320 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10544500 c2 y 



FTTT 



TUTJT 



S.4e-31 



Protein name 



Description 



Locus Name 



IgpiPlGUl-'Mk 



Acc# 



M30284 



Pig uterotemn mRJNA, complete cds , 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


834333J:i_i 


5118 


10340 


163 


IS5 407 


" S.5e-38 


Protein name 








Locus Name 


ACC# 


1 hypothetical protein 






"| pir:S76bV2 


S76672 


Description 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


i&&:/.:/M..±z...i 


5119 


10341 


333 


1002 




Protein name 








Locus Name 


Acc# 


Description 












MO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


— , Score 
Length 


Probability 


113MVlix...c&..±tl 


5120 


10342 


300 | 


903 1105 





Protein name 



Locus Name 



neat snocK protein bu 



1 |gp:BF06bir 



ACC# 
AJ006516 



Description 

Bacteroides torsyt hus groiilL gene, strain atuc 43U.37. 



1321 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4022312 ci y 



10343 



2.0e-36 



Protein name 



Description 



Locus Name 



sp:CHlU_PORGl 



Acc# 



P42376 



10 Kti CJHAPERONIKJ (^ROTEM ciPMlO ) (PftOTfllN GRCE^) 



ORF Name 



110354667 ±2 lb 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



110344 



1 



[I2T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



144&6.a3.lL±l...b. 



Protein name 



NTID 



AAID 



1034$ 



NT 



AA 



Length Length 
B"53 



Score Probability 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



10346 



NT 



— — Score Probability 
Length Length 



£"0" 



Locus Name 



Acc# 



Description 
NO-HIT 



1322 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



10347 



Length Length 



Score Probability 



TFT 



Locus Name 



Acc# 



INO-HIT 



ORF Name 



±9£±l&b:.L±±J.L 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



10348 



Length Length 
ZT5 



— Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



5127 



10343 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



| 215.BA0.b.7...±1 ...10. I 



NTID AAID Length Length 

11 0350 | pr 



122 8 



Score Probability 
"UTOTS 



[7T 



Locus Name 



|sp.:UCRH_yiiAiiT 



ACC# 



P00127 



Description 

(MITOCHONDR I AL UlUtiE PkOTUlM) (COMPLEX I II PohVPHP'l'lDl! VI) 



1323 



NT 



ORF Name 



24024067 ri b 



NTID 



AAID Length Length 



AA 

— , Score 



10351 



TIT 



Probability 
|2.5e-2a 



Protein name 



Description 



Locus Name 



| gp:B F U6^096 



Acc# 



U63096 



Bacberoid es rragxlis (bctA) gene, complete cds . 



ORF Name 



NTID 



AAID 



110352 



Protein name 



conserved hypotnetical protein 



Description 



NT 

Length Length 



AA 

— , Score 



Probability 
5.4e-27 



Locus Name 



1 (pir:fl723BO" 



Acc# 



G72380 



ORF Name 



Protein name 



NTID 



AAID 



10353 



NT 

Length Length 



AA 

— , Score 



TUT 



Locus Name 



Probability 



Acc# 



Description 



[NO-HIT 



ORF Name 



25.mJ....cl...b.4.. 



Protein name 



NTID 



5132 



AAID 



10354 



NT AA Score 
Length Length ' 



7T 



TTT 



Locus Name 



Probability 



Acc# 



Description 



MO-HIT 



1324 



# 



ORF Name 



2544267b rl 4 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



"ZTJT 



Length Length 




Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



10356 



Length Length 
TW3 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NTID 



AAID 



[10357 



— — Score P robability 
Length Length — 

— 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



10358 



Length Length 
F£33 



Score 



T5T 



Locus Name 



Probability 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



— Score Probability 



5TTT 



103S9 



0 . 045 



Locus Name 



H+- transporting ATP synthase, protein b 



pir:Tlll2l 



Acc# 



T11121 



Description 



1325 



ORF Name 



NTID 



26854156 tl 2 



513S 



10360 



Protein name 



hypothetical protein HOiiFuy.i 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



11647 



Locus Name 



IpiriT^fey 



7.1e-06 



Acc# 



T33369 



ORF Name 



3.2ufiAM3....cl^....... 



Protein name 



NTID 



110361 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



|3.2.0.6.^b.ai..±i..Ab.., 



Protein name 



NTID 



10362 



ATP syntnase, suounrt f 



Description 



NT 



AAID Length Length 



AA 

— Score Probability 



pur 



FT5" 



Locus Name 



J [pir:kfo^22T 



0.014 



ACC# 



H69227 



ORF Name 



Protein name 



NTID 



10363 



Description 
HYPOTHIjI ' 1' I 0 AL 15. 7 KB PROTEI N (Okl'6)" 



NT 



AA 



AAID Length Length 



Score Probability 



[T7T 



!T3T~ 



TUF" 



Locus Name 



lsp:YPI6_CJLOPli! 



5.4e-Ob 



ACC# 



P18017 



1326 



NT 



AA 



ORF Name 



NTID 



14409433 c3 yb 



AAID Length Length 
FT55 — 



10364 



Score Probability 
0.049 



85 



Protein name 

0&F MSV223 hypotnetxcax protein 



Locus Name 
bp:AP06386b 



Acc# 



AF063866 



Description 

"Melanoplus sanguinipes entomopoxvirus , complete genome. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


S3l463__cl_56 


5143 


103£5 


217 1 


£54 




Protein name 








Locus Name 


Acc# 


Description 












MO-HIT 










i 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


21610.1-2.^1^5. 


.... 5144 


1036^ 


204 


615 141 


9.2e-0S 


Protein name 








Locus Name 


Acc# 


integrate intNl 








"J gp:BtfU51917 


U51917 


Description 




Bacteroides unitormis insertion element JNtiux 


tragment, incegraseinuu ycuc, 




complete cds . 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


Zll±btlb.txJk:±J<L 


.... 514 b 


10367 


71 


rrs 52- 


0.01b 


Protein name 








Locus Name 


Acc# 










sp:"<S3P_SC!HMA 


P20287 


Description 


'rPT ^fffKf \ — nr. 








1 



1327 



ORF Name 



NT ID 



4103388 C3 9 



Protein name 



neuroendocrine protein 7B2 



Description 



— — Score Probability 
AAID Length Length 



IT 



62 



Locus Name 



Ipir^O^yiB 



0.0027 



Acc# 



S03938 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&9£M25.».a±.±0. 


5147 


10369 


150 


450 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


52£51ktt...c:L..A 


5148 


10370 


|72 


219 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


S.l££.5M...alJI 


5149 


10371 


205 




276 


l.le-23 


Protein name 








Locus 


Name 


Acc# 



methyl transferase 



| |gp : STRMTk" 



L29323 



Description 

Streptococcus pne umoniae methyl transferase gene cluster, compietesequence , 



1328 



NT 



AA 



ORF Name 



NTID 



AAID 



5150 



10372 



Length Length 




Score Probability 
3.8e-24 



Protein name 
Description 



Locus Name 



sp : BGAL__HUMAN 



Acc# 



P16278 



NT 



AA 



ORF Name 



NTID 



cl 4 



AAID Length Length 
553 



Score Probability 



10375 



TTT 



Protein name 



Description 



Locus Name 



Acc# 



|N0 -HIT 



ORF Name 



NTID 



AAID 



l3.0.aS16.Z..±2....1i.. 



10374 



— — Score Probability 
Length Length — 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT ' 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1037S 



l . 3e- -34 



Protein name 



Locus Name 



Jaeta-glucosidase 



IgpikAUy^BUy 



Acc# 



U92808 



Description 

"Ruminococcus albu s beta-glucosidase (gluA; mRNA, complete cas. 



1329 



NT 



AA 



ORF Name 



NT ID 



24644b7b cl 33 



AAID Length Length 
WTV 



Score Probability 
|2.6e-i8 



Protein name 



Description 



Locus Name 



IspiYIBJMSCoLl 



Acc# 



P37690 



mOTHhi'l'lCAL 46.6 Kb J^OTBIfl IN ^cJB-l'bH INTl^ BfllC KdciluN' 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length — 



4476427 cl 34 



15155 



10577 



TTOTT 



3.3e-26 



Protein name 



Locus Name 



' bZIP histiaxne Kinase 



1 |gp:PI>UyiB^r 



Acc# 



Y18245 



Description 



Pseudomonas putida todX, fcodi? 1 , todUl, toad, coqb, cooa, 
todl, todH, todS, todT genes. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TUT 



2.3e-19 



Protein name 



Locus Name 



response regulator urrA 



pir :T> l J222B 



Acc# 



D72228 



Description 



ORF Name 



Protein name 



Description 



NTID 



10379 



— — Score Probability 



AAID Length Length 



914 



Locus Name 



IsptBtiLtTAakTU 



1.2e-yi 



Acc# 



P27034 



1330 



NT 



AA 



ORF Name 



NTID 



AAID 



10251900 cl 94 



Length Length 
^1 



Score Probability 



TCB" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



10381 



Length Length 




TST" 



Score Probability 
i.7e-23 



\T7T 



Protein name 



Description 



Locus Name 



Acc# 



AC006202 



Arabidopsis thaliana chro mosome II SAC T3b23 genomic sequence, complete 
sequence. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TUT&T 



585 



0.6622. 



Protein name 



Locus Name 



hypothetical protexn c04040 



|pir:S75406 



Acc# 



S75406 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



110383 



— l , S core Probability 
.3 .9e-1.0 



Protein name 



Locus Name 



lsp:YS23_MY<JTU 



Acc# 



P71786 



Description 

HYPO T HETICAL 2 V . 1 KB PkOTElM (11211.2 90 



□ 



1331 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



12687926 ti lb 



Protein name 



10284 



FTTT 



1508'" 



Locus Name 



2.2e-133 



Acc# 



Description 



sp;UXUA_UAiillN 



P44488 



MAMNONATE DEHVDMl'A^E, (D-MMNoN ATO HVbkOLA^) 



ORF Name 



NTID 



NT AA 

— — Score Prob abi l ity 
AAID Length Length 



145863-87 c3 163 



Protein name 



5TST 



10385 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



1.7.8.40.£...tl...iy... 



Protein name 



NTID 



5164 



AAID 



1038£ 



NT 



AA 



— — Score Pro bability 
Length Length 

355 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT 



AAID Length Length 



AA 

— Score Probability 



2.0.5.3.i:/.6.a..±i....b.^.. 



Protein name 



10387 



7^" 



Locus Name 



6 ,2e-28 



Acc# 



dihyarodipicoiinate reductase 



Description 



pxr :A72246 



A72246 



ORF Name 



|2L18.9.7.1S.2..±L..M„ 



Protein name 



NTID 



10388 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



3.8e-l6 



ACC# 



hypotnetical protein V 



Description 



pir:S20799 



1332 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


222527b_c2_I20 


5167 


10389 


51 


185 






Protein name 








Locus 


Name 


Acc# 


Description 
















WO -MIT 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


mais&i..±a...!*H 


5168 


10390 


411 


1236 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT 












i 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


22aS15.2L.±I...6. 


5169 


10391 


528 


1587 


1117 


3.8e-ii^ 



Protein name 



Locus Name 



hypothetical protein mex* 



pir :T3UbJU 



Acc# 



T30830 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



FT7U" 



Length Length 
^5 



Score Probability 
i.5e-6t* 



Protein name 



Locus Name 



oxidoreductase 



Igp : NO^HkMA 



Acc# 



L37087 



Description 



" Kfostoc sp. kTCC 29133 oxi doreductase (hrmU) and HrmA tnrmA) genes , complete 
cds . 



1333 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


24225682_tl_5 


5171 


10292 


294 


1185 




493 


5.0e-47 


Protein name 








Locus 


Name 


Acc# 










sp:YHIU 


JICOLl 


P37636 


Description 
















fftfiCOftiJoft | 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


245096S0_±2_40 


5172 


10294 


312 


939 




152 


7.8e-16 



Protein name 



Locus Name 



prokaryotic type I signal peptidase sipj?" 



l gp:AF06blb9 



Acc# 



AF065159 



Description 

Bradyrhizobium J aponicum putative aryisuitatase tarsA) , putativesoiuDie 
lytic transglycosylase precursor (sltA) , dihydrodipicolinate synthase (dapA) , 
MscL (mscL) , SmpB (smpB) , BcpB (bcpB) , RnpO (rnpO) , RelA/SpoT homolog (relA) / 
PdxJ (pdxJ) , andacyl carrier protein synthase AcpS (acpS) genes, complete 
cds; prokaryotic type I signal peptidase SipF (sipF) gene, sipF -sipSallele , 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— ; Score 
Length 


Probability 


2A&151&1.11..J. 


5173 


10395 


491 


1475 503 


4.4e-48 


Protein name 








Locus Name 


Acc# 



OprM 



lgp:AS0il3Bi 



AB011381 



Description 

Pseudomonas aeruginosa gene tor OprM, complete cas. 



ORF Name 



NT ID 



NT AA 
— — Score 
AAID Length Length 



"STTT 



10396 



TIT 



Probability 
|5.9e-21 



Protein name 



Locus Name 



conserved hypothetical protein 



bir:H7241V 



Acc# 



H72417 



Description 



1334 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10597 



3TT 



9 . 8e-85 



Protein name 

hypothetxcai protein mexj?' 



Locus Name 



pir:Ti08JU 



Acc# 



T30830 



Description 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



TTT3W 



8.5e-29 



Protein name 



Locus Name 



phosphoglycolate pnospnatase igphj nomoiog I ipir :C/ui84 



Acc# 



C70184 



Description 



ORF Name 



NTID 



— ■ . — Score Probability 



ET7T 



AAID Length Length 



10599 



3TT 



1353" 



2.7e-32 



Protein name 



Locus Name 



polysialic acid capsule expression protein I |pir :B7U434 



Acc# 



B704.34 



Description 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



26&l&£i)A..±'^l 



10400 



T3TT 



Protein name 



Locus Name 



beta-galactosidase 



gp:AP0bb482 



Acc# 



AF055482 



Description 

T hermoboga neapoli tana galactose utilization operon, compietesequence . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



iflftiiai„.ai...ai 



10401 



T35T 



Protein name 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NTID 



335*>3y63 c2 136 



AAID Length Length 
110402 



Score Probability 



74 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



nfi2££5a..±2...aa...,. I 



AAID Length Length 
FTS 



10403 



T7T 



Score Probability 
8.7e-43 



Protein name 



Description 



Locus Name 



Acc# 



sp:LPXAJ<!<JoLl 



(EC 2.3.1.125) (U3D^-N-ACETYL^Ltfg05AMlNE AC V L'l'kAN S FERA& E ) 



ORF Name 



NTID 



— — , Score Probability 
AAID Length Length — 



10404 



i.3e-4S 



Protein name 



Description 



Locus Name 



Acc# 
Q04805 



HYPOTHETICAL PROCE SSING PROTEAN, (OkFP) 



NT 



AA 



ORF Name 



NTID 



!44M7.12..±1...14 



AAID Length Length 

tztj — 



10405 



Score Probability 
|1.5e-iSi 



T7T 



Protein name 



Locus Name 



lsp:LEP__3ALTY 



Acc# 



P23697 



Description 

SIGNAL PEPT I DASE 1, (S2ASE I ) (LEADED PEPTIDASE 1) 



1336 



ORF Name 



549091 c3 174 



Protein name 



# 



NT ID 



AAID 



10405 



NT 



AA 



Length Length 




Score Probability 



TIT 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10407 



T7T 



TTTT" 



14 .8e-26 



Protein name 



Locus Name 



protein- tyrosine pnospnatase 



Acc# 



AB028630 



Description 



Clostridium pertringens hyp27, JoacH , ptp, cpd genes tornypotneticai 
protein, bacterial hemoglobin, protein- tyrosinephosphatase , 2 ! , 3'-cuclic 
nucleotide 2 ' -phosphodiesterase , partial and complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


fiaimLcLiiia 


5186 


10408 


214 


545 






Protein name 








Locus 


Name 


Acc# 


Description 
















NO-HIT 


















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


bAl$&^tlJ.ll 


5187 


10404 


485 


1455 


880 


4.9e-88 



Protein name 



Locus Name 



AT£- dependent RNA neiicase nomolog ycifcR 



pir:D64772 



Acc# 



D69772 



Description 



1337 



■ — — Score Probability 
AAID Length Length 

110410 



AA 



ORF Name 



NTID 



12402186 ci 12 



Protein name 

Description 
MO-HI T 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10411 



T75" 



1128" 



[4.5e-32 



Protein name 



Locus Name 



histiciine Kinase 



|gp:AP114442 



Acc# 



AF114442 



Description 

"Kfostoc punctirorme histiciine Kinase (nepK) gene, complete cds. 



ORF Name 



NTID 



AAID 



10412 



— — Score Probability 
Length Length — 

T75 



2.6e-26 



Protein name 



Locus Name 



2, 3 -bisphosphoglycerate- independent: 



lgp:APi2ooyi 



ACC# 



AF120091 



Description 



Bacillus stearotnermophilus 
2,3-bisphosphoglycerate-independentphosphoglycerate mutase (pgm) gene, 

complete cds. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


^BbALb!l.±^X 


5191 


10413 


186 


561 


212 


3..0e-lV 


Protein name 








Locus 


Name 


Acc# 










sp:YP20, 


_BACLI 


P05332 


Description 














HYPOTHETICAL 


PROTEIN 










i 



1338 



ORF Name 



11777161 65 



Protein name 



NT ID 



AAID 



10414 



NT 



AA 



Length Length 
£3 



Score Probability 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



10415 



NT AA „ _ , _ . . 
— — • , Score Probabxlxty 
Length Length 



2W 



Locus Name 



i.9e-24 



Acc# 



hypothetical protein MTH1452 



Description 



pxr:b69060 



D69060 



ORF Name 



16.6.7.6.5.Ub....a2...6.2. 



Protein name 



NTID 



AAID 



10415 



NT AA 

— — • , Score Pro bability 
Length Length 



TUT" 



Locus Name 



2.7e-i<5 



Acc# 



probable hydrolase 



Description 



|pir:T37122 



T37132 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10417 



HIT 



TUTT 



|3.5e-103 



Protein name 



Locus Name 



Acc# 



hypothetical protein 



pir : JQ1020 



JQ1020 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



19£±0£.ll.±L..b. I [5T^ 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



5157 



10419 



429 



1290 



|4.2e-18 



Protein name 



Locus Name 



Acc# 



DNA damage- inducible protein. PAB1438 



pir:C75053 



C75053 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



2.2.&£0±2&..±1...&.. 



Protein name 



Description 



5TW 



10420 



10.031 



Locus Name 



sp : SPRC_XMMLA 



Acc# 
P36378 



(OS T EONEC T IN) (ON) (BASEMENT MEMBRANE PROTEIN BM-40) 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Protein name 



10421 



75" 



0.039 



Locus Name 



Acc# 



TAA~ 



Description 



|gp:AC005S6S 



AC005565 



Homo sapiens cnromosome lb, cosmid clone 444By (LANL) , compietesequence . 



ORF Name 



NTID 



NT AA „ _ , , . _ . , 
— — , Score Probability 
AAID Length Length — 



10422 



Protein name 



TTT 



Locus Name 



Acc# 



Description 



BTO-HIT 



1340 



ORF Name 



24415502 t2 IB 



Protein name 



NT ID 



10425 



# 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



BITOT" 



110424 



277 



PT7T 



Protein name 



Locus Name 



7-alpha-hyaroxysteroid ctenyarogenase 



Acc# 



AF173833 



Description 



Bacteroides tragilis 7 -alpha- hydroxys teroid dehydrogenase (hdhA)gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



\lll26.^L.±l...lk I 



AAID Length Length 
7T~ 



10425 



Score Probability 
0.021 



Protein name 



Locus Name 



hypothetical protein cubiuw 



|pir:Tia460 



Acc# 



T18460 



Description 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



\lS£M25A...al...b:.L 



10426 



Protein name 



Description 



Locus Name 



Acc# 



1341 



# 



NT 



AA 



ORF Name 



NTID 



^6J0bJ4y ci 44 



5205 



AAID Length Length 
10427 



5TT 



[T7T 



Score Probability 

rsrs — 



A.Se-10 



Protein name 



Locus Name 



sp : FEOB___METJA 



Acc# 
Q57986 



Description 

l^kkoUfcJ IRON TRANSPORT PROTEIN B MOMOLOG 



NT 



AA 



ORF Name 



NTID 



AAID 



25555517 c2 59 



10425 



Length Length 



RT7T" 



Score Probability 
T(T2 



Protein name 



Locus Name 



hypotnetical protein SCI30A.19 



pir :T36799 



ACC# 



T36799 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3.6.15.D.2..7..7...±1...S I 



10429 



Length Length 



W5T 



Score Probability 
6.9e-I5 — " 



Protein name 



Locus Name 



transcription regulator AraC/XylS ramily 
homo log ydeE 



pir:G69777 



Description 



Acc# 



G6 9.777 



NT 



AA 



ORF Name 



NTID 



I4S.5.3.3.S.7....C2...B.S. I \ZZU$ 



AAID Length Length 
10430 



Score Probability 
|2.4e-24 



Protein name 



Locus Name 



sp:YUXK_BACSU 



Acc# 



Description 

HYPOTHETICAL lS-7 KB PROTEIN IN £B£D-C0MA INTERMENT C REGION (0RF2) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length • * 



60494S2 r3 35 



10431 



2¥5" 



57T 



l.Ve-55 



Protein name 



Description 



Locus Name 



sp:UNGJKJMAN 



Acc# 



P13051 



U£>ACIL-DKfA (jLVCO^VLAgfl MtiaKflmSOft, (tJDg) 



NT 



AA 



ORF Name 



NTID 



16053437 c3 71 



AAID Length Length 
— 



10432 



Score Probability 




2.2e-§7 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir:JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



5211. 



10433 



Length Length 
TT75" 



3T8" 



Score Probability 
Ml 



7..8e-42 



Protein name 



Locus Name 



hypothetical protein 



pir : JQ1020 



Acc# 



JQ1020 



Description 



ORF Name 



6.a3..7.8.3..7....c.2...6.a.... 



NTID 



Protein name 

Description 
HYPOTHETICAL PROTfira HI1477 



AAID 



10434 



NT AA 

— , - — , Score Probability 
Length Length — 



ITT 



Locus Name 



sp:YTFE_HAli!lN 



5.5e-25 



Acc# 



P45312 



1343 



NT 



AA 



ORF Name 



124417250 t!i 7 



NTID 

— 



AAID Length Length 



Score Probability 



1043b 



7T 



Protein name 



Locus Name 



OmpK3 7 porin 



gp:KPN011502. 



Acc# 



AJ011502 



Description 



Klebsiella pneumoniae (strain SD8J ompK3 7 gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



S725132 ±3 6 



10435 



Length Length 




Score Probability 




3 . le-12 



Protein name 



Locus Name 



Acc# 



colicin I receptor 



gp:EC0CIR 



Description 



E.coli colicin I receptor gene, complete cds. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length i - 



10437 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



3.ZlS.yAZ...r.l...l,.. 



110438 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



1344 



• 



NT 



AA 



ORF Name 



3440S328 ±2 4 



NTID AAID Length Length 
^TD 1 110439 1 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



x — " — _ Score Probab ility 
AAID Length Length ■ L 



10440 



TIT" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



1 2l 3> 2> 3> 2> • > > t !<• >1 ■<■■■> ■■•■>• !• 



AAID Length Length 
10441 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



21119.11.±±J1 1 [SZZU 



NTID AAID Length Length 

110442 



TOTT 



Score Probability 
- 



,1.4e-bl 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6 02 7 



Description 



ORF Name 



NTID 



AAID 



3.5.3.5.15.&3....C.1...3. I 



10443 



Protein name 

Description 
NO-HIT 



NT 



AA 



Length Length 
71 



Score Probability 



Locus Name 



Acc# 



1345 



NT 



AA 



ORF Name 



NTID 



AAID 



10444 



Length Length 
?TZ5Z — 



Score Probability 
5.5e-07 



Protein name 



Locus Name 



conserved hypothetxcal protein yknZ 



pir :K6S?bby 



Acc# 



E69858 



Description 



1346 



